01What an AI model is
In AI, a model is a mathematical system that has been trained to perform a specific task. The training process involves exposing the model to large amounts of data and adjusting its internal parameters (billions of numbers) until it performs the task well on the training data and generalises to new examples it has not seen before.
A language model is trained to predict and generate language. An image classification model is trained to identify what is in images. A recommendation model is trained to predict what content or products a user might engage with. In each case, the 'model' is the trained mathematical system that performs the task, encapsulated in a set of parameters that can be saved, loaded, and run on computer hardware.
02Why different models behave differently
Two AI models trained for the same task but trained on different data, with different architectures, or with different training objectives will behave differently. This is why GPT-4, Claude, and Gemini give different responses to the same question, even though all three are large language models designed for similar purposes.
The differences reflect choices made during training: what data was included, what values were embedded through reinforcement learning from human feedback, what safety constraints were applied, and what tasks the training emphasised. These choices produce models with different capabilities, different tendencies, and different failure modes.
03What this means for business decisions
Understanding that different models have different characteristics helps you make better AI procurement and deployment decisions.
Different models are better at different tasks. Claude has been noted for strong performance on long document analysis and following nuanced instructions. GPT-4 has been noted for coding ability and broad general knowledge. Gemini has been noted for multimodal tasks. These are generalisations that change as models are updated, but the principle holds: choose your model based on what your use case requires, not just on vendor familiarity.
Different models have different safety and reliability characteristics. Anthropic designs Claude with safety as a primary design objective. OpenAI applies specific safety training to GPT-4. Google applies its own responsible AI standards to Gemini. These differences have practical implications for regulated use cases where the AI's reliability and refusal to produce harmful outputs is important.
Model versions matter: GPT-4 and GPT-3.5 are different models with significantly different capabilities. Claude 3 Opus and Claude 3 Haiku are different models with different performance and cost characteristics. When evaluating AI performance or cost, specifying the exact model version is important for comparable assessment.
Key Takeaways
- 1.An AI model is a mathematical system trained on data to perform a task; it is encoded in billions of numerical parameters that can be saved, shared, and run.
- 2.Different models trained for similar tasks behave differently because of differences in training data, architecture, safety training, and design objectives.
- 3.Model choice should be based on task fit: different models have relative strengths in long document analysis, coding, multimodal tasks, or following complex instructions.
- 4.Safety and reliability characteristics differ between models and providers; these differences matter for regulated or high-stakes use cases.
- 5.Model versions within a family (GPT-4 vs GPT-3.5, Claude Opus vs Haiku) differ significantly in capability and cost; specify version when comparing.
References & Further Reading
- [1]Google: Understanding AI ModelsGoogle AI
Want to discuss this with an expert?
Book a strategy call to explore how these insights apply to your organisation.
Book a Strategy Call