T5Gemma 2 follows the same adaptation idea introduced in T5Gemma, initialize an encoder-decoder model from a decoder-only checkpoint, then adapt with UL2. In the above figure the research team show ...
Adjust the root paths in train_donut.py according to your setup. Data will be downloaded and preprocessed automatically when running for the first time. This will take some time.