following the ReLU activation, we are using Dropout which has a chance of 0.5. You will not find the mention of dropout inside the architecture table inside the paper.
In the paper, the authors introduced not one https://poppietxjc541791.mpeblog.com/51338924/the-5-second-trick-for-https-ln-run-vgdqg