感觉这个设计的前端做得很糟糕。
aclk period 1000ps: 1GHz, quite fast.
*/w_out_fifo/rptr_reg_*: This is probably FIFO read pointer
From FIFO read pointer to axi_wdata output, there is no flop, no pipeline stage. For high speed design, this is crazy.
During place_opt_design, Innovus struggled to fix maxCap/maxTrans/maxFanout while trying to meet set_output_delay (600ps out of 1000ps, 12% clock uncertainty, plus derate, that's very tight). That may be why Innovus inserted large inverters.
Register axi_wdata may fix the problem.
Quality of front end design looks poor. It almost seems that the front end designer has no experience in high speed logic design. Just my 2 cents.