Lesson 4: Families of Models
The landscape
Section titled “The landscape”There are several major families of models available today. Each has different strengths, pricing, and context-window sizes. The choice depends on the use case.
Major families
Section titled “Major families”Google Gemini
Section titled “Google Gemini”- Models: Gemini Pro, Gemini Ultra, Gemini Flash
- Strengths: Multimodal (text, images, video, audio), large context windows (up to 1M+ tokens), strong reasoning
- Best for: Tasks involving mixed media, long-document analysis
Anthropic Claude
Section titled “Anthropic Claude”- Models: Haiku (fast, cheap), Sonnet (balanced), Opus (most capable)
- Strengths: Strong instruction-following, long context windows (200K tokens), careful and safety-conscious outputs
- Best for: Code generation, analysis tasks, structured outputs
OpenAI GPT
Section titled “OpenAI GPT”- Models: GPT-4, GPT-4o, o1, o3
- Strengths: Broad general knowledge, strong coding ability, large ecosystem of tools and integrations
- Best for: General-purpose tasks, creative writing, code completion
Meta LLaMA
Section titled “Meta LLaMA”- Models: LLaMA 3, LLaMA 4 (open-weight)
- Strengths: Open-weight (can self-host), no API costs, customisable through fine-tuning
- Best for: Organisations that need data sovereignty or offline operation
Mistral
Section titled “Mistral”- Models: Mistral Large, Mistral Medium, Codestral
- Strengths: European-made, open-weight options, competitive performance at lower cost
- Best for: European regulatory compliance, cost-sensitive deployments
How to choose
Section titled “How to choose”| Factor | Question to ask |
|---|---|
| Task type | Is this code, text, or multimodal? |
| Context size | How much input does the model need to process? |
| Cost | What is the per-token price? What is the expected volume? |
| Privacy | Can data leave your infrastructure? |
| Speed | Is latency critical for this use case? |
| Accuracy | How important is factual precision vs. creative output? |
ReArch is model-agnostic — you can configure different models for different agents and switch between providers without changing your workflows.