Encoder–Decoders and Byte LLMs: T5Gemma 2 and…

Dec 19, 2025

The Weekly Kaitchup #123

2 Comments

So does it mean there is still hope for encoder-decoders and that it's in the small model size range? But aren't they harder to fine tune (or at least less convenient)? Or should we treat it more like interesting research.

Expand full comment

For now, that's very interesting research. I don't see why would someone use them in production.

Fine tuning them for tasks making sequence transformation, such as paraphrasing and translation, could be a good use case, but that remain to be demonstrated.

Expand full comment

Reply

Share

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts

The Kaitchup – AI on a Budget

Encoder–Decoders and Byte LLMs: T5Gemma 2 and…