A→Z
A2ZAI
Back to Glossary
models

Large Language Model (LLM)

AI models trained on massive text datasets that can understand and generate human-like text.

Share:

Definition

Large Language Models are neural networks trained on enormous amounts of text data. They learn patterns in language and can generate coherent, contextually relevant text.

Key Characteristics: - Billions of parameters (GPT-4 has ~1.7 trillion) - Trained on internet-scale text data - Can perform many tasks without task-specific training (zero-shot learning) - Use transformer architecture

Capabilities: - Text generation and completion - Question answering - Summarization - Translation - Code generation

Examples

GPT-4, Claude, Gemini, Llama 3, Mistral are all LLMs.

Want more AI knowledge?

Get bite-sized AI concepts delivered to your inbox.

Free daily digest. No spam, unsubscribe anytime.

Discussion