A machine learning or AI model used within the product, its version, provider, capabilities, and costs.
An AI model is the trained machine-learningLearningValidationAn insight gained from an experimentView reference → artefact a product depends on to generate predictions, text, images, or decisionsDecisionStrategyA recorded decision with context, rationale, and consequencesView reference →. It behaves like a versioned dependencyDependencyTeam & OrganisationA cross-team or system dependencyView reference → with a temperament: its weights encode statistical behaviour that shifts with each retrain, so the same prompt can return different answers across versions. Treating it as a tracked product component, with documented capabilitiesCapabilityStrategyAn ability that enables value deliveryView reference → and known limits, is what separates a controlled deploymentDeploymentEngineeringA deployment eventView reference → from a black box.
The discipline of documenting a model as a reportable artefact crystallised with Model Cards for Model Reporting, introduced by Margaret Mitchell and colleagues (including Timnit Gebru) at the 2019 ACM Conference on Fairness, Accountability, and Transparency. A model card records intended use, training data, evaluation metricsMetricStrategyA unified metric that measures progress, health, or behaviour across the productView reference → broken down by group, and known failure conditions. The format has since been adopted by Hugging Face, Google, and most foundation-model providers as the default way to ship a model with its provenance attached.
The foundation-versus-fine-tuned split followed the rise of large pre-trained models. A foundation model is trained once at scale; teams then fine-tune or prompt it for a specific taskTaskProduct SpecificationA unit of work within a story or epicView reference →. That changes the dependency graph: a product may rely on a vendor's base model and a separate fine-tune layer, each with its own version and eval history.
A support product routes tickets through claude-sonnet-4 for triage. The team pins the version, records its evaluation score against a 500-ticket benchmark (87 per cent correct routing), and tracks cost per thousand tokens. When the vendor releasesReleaseProduct SpecificationA shipped version of the productView reference → a newer checkpoint, they re-run the same benchmark before switching: the new model scores 91 per cent on routing but regresses on multilingual tickets, so they hold the upgrade for non-English queues. The model card makes that decision auditable.
In the Unified Product Graph, an AI model lives in the AI and intelligence region as a first-class dependency. A product reaches it through Productpowered byAI Modelhierarchy; its behaviour is shaped by product_powered_by_ai_modelAI Modelprompted viaPrompt Versionhierarchy, measured by ai_model_prompted_via_prompt_versionAI Modelbenchmarked byEval Benchmarkhierarchy, and metered by ai_model_benchmarked_by_eval_benchmarkAI Modelcosted byAI Cost Trackerhierarchy. Modelling the version, eval, and cost as distinct edges means an upgrade decision can be traced end to end: which prompts break, which benchmark moved, and what the new spend looks like.ai_model_costed_by_ai_cost_tracker
Type-specific fields on BaseNode
model_providerstringProvider or vendor
model_idstringUnique model identifier (e.g. "claude-sonnet-4-20250514")
model_versionstringSpecific version
model_purposestringIntended use case
cost_per_1k_tokensnumberCost per 1,000 tokens
context_windownumberMaximum context window (tokens)
latency_p50_msnumberMedian latency (p50, ms)
latency_p99_msnumberTail latency (p99, ms)
input_schemastringExpected input format or schema
output_schemastringExpected output format or schema
aliasesstring[]Alternative names
tagsstring[]Free-form classification tags
idstringrequiredUnique identifier (UUID)
typeNodeTyperequiredDiscriminator for the entity type
titlestringrequiredDisplay name
descriptionstringOptional detailed description
statusstringLifecycle status
tagsstring[]Freeform tags for filtering
5 phases — initial: evaluating
13 edge types connected to this entity.
product_powered_by_ai_modelai_model_prompted_via_prompt_versionai_model_benchmarked_by_eval_benchmarkai_model_costed_by_ai_cost_trackerai_model_flagged_by_hallucination_reportai_model_constrained_by_ai_guardrailai_model_compared_in_model_comparisonai_model_evaluated_through_ai_experimentai_model_trained_on_ai_datasetai_model_produces_ai_traceprompt_template_targets_ai_modelai_experiment_uses_ai_modelmodel_comparison_winner_is_ai_model