Back to Glossary

Large Language Model (LLM)

A neural network trained on massive text datasets that can understand and generate human language.

Large language models (LLMs) are deep learning systems trained on billions of words of text data. They learn statistical patterns in language and can generate coherent text, answer questions, write code, translate languages, and perform a wide range of language tasks.

Models like GPT-4, Claude, LLaMA, and Gemini represent the current state of the art. They use the transformer architecture and are trained using self-supervised learning on internet-scale data, then refined through techniques like reinforcement learning from human feedback (RLHF).

LLMs have created entirely new job categories and transformed existing ones. Engineers who can build applications on top of LLMs, fine-tune them for specific domains, or evaluate their outputs are in high demand across the AI industry.

Related AI Job Categories

    Large Language Model (LLM) — AI Careers Glossary | We Love AI Jobs