A Deep Dive into Transformers with TensorFlow and Keras: Part 2