This pertains to the example in the arch reference manual, 4.1 Memory Address Map.
When making an on-chip read/write to a core not your immediate neighbour It is not apparent which bus is going to be used, I think I have it worked out but want to make sure:
cMesh is for write ops on die, rMesh for read ops on die and xMesh is solely used for off die writes? Or is cMesh only used for N/E/S/W neighbours and xMesh is for on and off-die ops?
My question is how the arbiter handles read/write requests. The read latencies seem to be 8x the write latencies and the write is calculated at 1.5 ops/cycle. So if I have a single operation and its going to a node 6 north and 6 east then it will take 8 cycles for the value to appear in the destinations memory as their is zero overhead to load the data on the network? Similarly I imagine that the arbiter is able to load the next set of values onto the network after one cycle as the first one is already 1.5 away?
What is the reason for the 8x read latency? I understand you need to send the request first but 8 times seems a bit much.