Comet unveils cost intelligence tool for AI code spend visibility
Engineering teams moving fast with Claude Code and Codex hit a hard wall: real spend visibility. If AI tools are now core infrastructure, not side projects, leaders need answers when budgets spike — fast. Comet's new Opik cost intelligence marks a real shift: granular, real-time, per-engineer tracking that surfaces the actual cost of every coding agent event before the invoice drops. Cost reviews after the fact are obsolete.
What is Comet Opik and why does AI coding agent cost tracking matter?
Comet Opik now delivers cost intelligence for Claude Code and Codex — the first tool exposing exactly where your AI coding budget is going, at the level that matters: per engineer, per team, per task. Positioned as an AI observability and evaluation platform, Opik was already monitoring AI model performance and usage. With the introduction of coding agent cost tracking, it closes the gap for engineering leaders responsible for real budgets: not just model output, but the bottom line.
The friction is industry-wide. Engineering tool spend has shifted from the visible (SaaS licenses, infrastructure) to the opaque (AI token consumption billed per-use across sprawling code bases). Until now, leaders couldn't answer even basic questions:
- What does each engineer cost us in Claude Code and Codex API usage this sprint?
- Are we burning expensive tokens maintaining idle or redundant managed code plugins (MCPs) and skills?
- Which teams or feature tracks are driving the majority of spend — and does it align with value shipped?
Without cost tracking for AI coding agents, strategic budget conversations defer to vendor invoices — long after optimization is possible. Opik's release turns the black box transparent.
Takeaway: Having "AI observability" is table stakes; cost tracking closes the expert visibility gap and arms engineering leaders with actionable data.
How does Comet Opik track spending on Claude Code and Codex in real time?
Opik's core win is granularity: it tracks AI coding spend not just as a lump sum, but in real time, broken down by engineer, by team, and by task. As soon as an engineer invokes a Claude Code or Codex-powered workflow — whether generating code, debugging, auto-fixing a PR, or running automations — Opik attributes tokens, cost, and context usage instantly.
Each coding agent event through Claude Code or Codex APIs is measured for:
# Example metrics captured per event
- engineer_id: "alex@company.com"
- team_id: "backend"
- action: "generate_code"
- tokens_used: 2420
- model_id: "claude-code-v4"
- mcp_id: "default-refactor"
- timestamp: 2026-06-10T11:12:33Z
- cost_usd: 0.018
- task_ref: "feature/checkout-fix"Realtime data allows engineering leads to see — before any invoice arrives — how spend is tracking against budgets at every level of the org chart. This supports everything from high-level budget planning to tactical debugging of runaway team spend after a workflow misconfiguration. The visibility is live — not a lagging report.
Integration is API-driven: Opik connects to your Claude Code and Codex usage via native integration, meaning no complex manual reconciliation or batch uploads. Instead, cost intelligence flows continuously, surfaced in dashboards and exportable for finance reviews.

The depth of reporting also enables accountability and forecasting for specific projects or features, not just generic usage buckets.
Takeaway: Comet Opik moves cost understanding from after-the-fact invoice audits to live, per-action attribution — compressing the feedback loop for better budget control.
11 production screens. Auth, DB, Stripe — all wired.
The SaaS Dashboard Kit ships everything already connected. No Vercel config, no Supabase account. Live demo at saas.otf-kit.dev.
What cost optimization strategies does Opik use to reduce AI spend?
Opik doesn't stop at reporting. Its real advantage: it automatically eliminates waste in Claude Code and Codex usage without blocking developers or forcing them to ration access. The optimization happens behind the scenes, so teams keep moving fast.
Key strategies include:
-
Eliminating unused skills and idle MCPs: Opik continuously inspects which managed plugins and code skills are loaded but never invoked. These idle attachments can quietly burn tokens every invocation, especially across large orgs where context configs drift. Opik auto-drops unused components from the active context — zero friction for devs.
-
Fixing misconfigured compaction strategies: AI context size and token compaction settings are a common hidden source of runaway cost. Opik detects when models are running with suboptimal context windows or are holding onto expensive history longer than needed. It applies best-practice compaction, reducing per-call costs.
-
Exposing and optimizing by model and workflow: Many teams default to higher-cost models (or oversized MCPs) for tasks that don’t require them. Opik surfaces where default model selection can be optimized per workflow — with recommendations and (optionally) automatic downgrades when appropriate.
These aren't manual review steps — Opik executes them as part of the monitoring pipeline. The result is verified in the source article: one enterprise customer was able to reduce their AI coding spend by "millions annually" after deploying Opik, without restricting developer AI access or slowing output.
// Sample (pseudo) optimization hook
if (context.hasUnusedMCPs()) {
context.removeUnusedMCPs();
}
if (compaction.isSuboptimal(model, contextSize)) {
compaction.applyBestPractices(model);
}Manual monthly cost reviews typically come too late, and often can't correlate spend to source. Opik's automated optimizations run with every workflow.
Takeaway: Opik delivers continuous, automated cost cutting — not after-the-fact reminders — while keeping developer experience frictionless.
How engineering leaders can use Comet Opik today to control AI coding costs
Getting started with Opik in a real engineering org is direct. Leaders integrate their Claude Code and Codex API credentials with Opik via the platform’s onboarding flow. Once connected, all usage — across automated workflows, direct code generation, agent-triggered debugging, and tool invocations — is tracked from day one.
Practical steps:
- Integrate with existing workflows: Configure Opik using your AI agent API keys and map users and teams to Opik identifiers.
- Access real-time dashboards: Review dashboards segmented by engineer, team, and project. Cost and token burn by feature, MCP, or workflow is visible instantly.
- Identify inefficiencies: Use Opik’s cost intelligence reports to spot unused skills, idle MCPs, or model-selection anomalies.
- Apply (or automate) recommended optimizations: Enable Opik’s auto-optimization engine, or review recommendations for model, context, and plugin configuration. Confirm changes directly in the Opik UI, or allow automated enforcement for non-critical settings.
- Align spend with development priorities: Reference cost per feature delivered or bug fixed. The data shows which investments return value (and which do not).
Every org will surface surprises. Teams often discover a long-misconfigured agent, an MCP loaded for months but never used, or unexpected spikes tied to non-critical workflows. Opik makes these obvious — and fixable.
# Example: enable auto-optimization mode
opik config set auto_optimize trueFinance and dev leads are no longer trading off shipping features against a vague sense of growing AI bills. They can tie spend, outcome, and ownership together.
Takeaway: With Opik, engineering leaders can see, act on, and control AI coding spending as part of their daily workflow — not in the rearview after fiscal closes.
What does the future hold for AI cost visibility in software development?
AI adoption is only accelerating. Toolchains stack Claude Code, Codex, plugins, wrappers, and custom MCPS into fast-evolving, interconnected pipelines. Without integrated, real-time cost intelligence like Opik, complexity swamps budget control. Opaque token burns and invisible misconfigurations threaten to turn AI cost into an enterprise risk — not just a line item.
Future iterations of platforms like Opik will be forced to go deeper: linking AI cost not only to code generation, but to shipped product value, regression frequency, model performance, and even customer-facing feature usage. AI observability will be synonymous with cost alignment — controlling not just model usage, but business outcome.
As Comet’s leadership stated in their announcement, engineering spend on AI tooling is rapidly becoming core infrastructure — and the winners will be those who see, debug, and optimize it as such.
Takeaway: Integrated, actionable cost intelligence is moving from nice-to-have to core requirement as AI agent complexity and spend grow.
Closing
Claude Code and Codex have redefined what’s possible for engineering velocity — and budget complexity. Comet Opik cost intelligence for Claude Code and Codex is the first move toward true spend visibility: real-time, granular, actionable. It lets engineering leaders to scale AI-led development confidently, controlling budgets without throttling innovation or developer experience. If AI tools are now as central as your code repo, then Opik is the cost intelligence layer every serious team will need next.

Ship the product, not the setup.
- 11 production screens — auth, billing, team, analytics, settings
- Real Postgres + Stripe + Better Auth, all wired on day 1
- CLAUDE.md pre-tuned so your agent extends instead of regenerates