
June 3, 2026
A vendor quotes $0.05 per label. Your project needs 1 million labels. You budget $50,000. Eight months later you've spent $187,000 and the project is 60% complete. This isn't because the vendor lied — it's because the per-label sticker covers about 15% of the actual cost stack.
Here's the full TCO model, with worked numbers across three delivery options.
Raw labeling rate — what the vendor quotes. QA review time — usually a second labeler at 30-50% of the original rate. Edge case escalations — the 5% of samples that take 10x as long. Schema iteration — when your taxonomy evolves mid-project. Rework — when your model evaluation shows class confusion and you re-label affected samples. Project management — your PM's time managing the vendor relationship. Tool license — the per-seat annotation platform fee.
Most teams budget only the first item. The seventh component matters because labeling without a platform creates downstream costs in dataset versioning, audit trail, and integration with training.
Scenario: 1 million product photos, e-commerce attribute tagging, 12 classes.
Manual labeling (offshore vendor at $0.05/label headline): Raw labeling $50K. QA at 40% = $20K. Edge cases at 5% × 10x = $25K. Schema iteration (2 cycles) = $8K. Rework at 8% = $6K. PM time $24K (6 months at 25% load). Tool license $9K. Total: $142K. Effective per-label: $0.142.
AI-assisted self-serve (foundation model pre-labels + human verification): Pre-label compute $14K. Verification labor $32K (3x faster than from scratch). QA $10K. Edge cases $18K. Schema iteration $4K. Rework $4K. PM time $14K. Tool license $11K. Total: $107K. Effective per-label: $0.107.
Managed service (vendor delivers labeled dataset): Quoted $145K all-in. PM time $9K (lighter touch). Tool license bundled. Total: $154K. Effective per-label: $0.154.
Managed service looks expensive at the sticker but is the cleanest at 1M scale because all coordination cost is absorbed by the vendor. Manual offshore looks cheap at the sticker but the PM time required to manage a vendor at this scale is structurally expensive. AI-assisted self-serve wins on cost if and only if you have the engineering capacity to manage the model + workforce loop.
Below 50K labels, manual is fine. Between 50K-300K, AI-assisted self-serve wins on cost but assumes you have ML engineering. Above 300K, integrated managed service usually wins on total time and risk, especially when audit and compliance overhead is loaded in.
The hidden cost is your team's attention. PMs running offshore vendor relationships at 1M-label scale lose 25-30% of their bandwidth for the duration of the project. Integrated platforms compress this — Intellabel's Workforce Manager + managed labeling service handles the vendor coordination inside the same workspace you use for dataset versioning and training. The dollar savings are marginal; the attention savings are not.