Years ago, I found this great article from Ars Technica, "A jargon-free explanation of how AI large language models work" that lives up to its name. Even though it describes models from July 2023 (and they have evolved greatly in the intervening three years), the article nicely bridges the gap between the simplistic “it’s just word prediction” explanation and one needing a computer science degree to understand.