cllctd — About

Our Mission

Why we built
cllctd.

Every major AI model in production today was trained almost exclusively on data from North America, Western Europe, and East Asia. India — with 1 in 5 people on earth — is a ghost in the training data.

This isn't a minor gap. It means speech models that fail for Hindi and Tamil speakers. It means robotics systems untrained on the environments where billions of workers operate. It means AI that works well for a minority of the world and poorly for everyone else.

cllctd is the marketplace that closes this gap. We connect India's contributors, voice networks, and worker communities with the AI companies that need their data — while ensuring people who generate that data are compensated fairly.

Data that was collected. Not scraped. That's not a tagline. It's the entire business.

Consent First

Every dataset on cllctd carries verified consent documentation. No grey areas. No "publicly available" rationalisations. If a person's data is being used, they agreed to it.

Contributor-First Economics

Contributors are compensated fairly — the only model that builds a lasting supply side.

Niche Before Scale

We go deep on specific verticals — egocentric worker data, regional voice, institutional archives — where margins are high and moats are real.

Dubai Licences. India Supplies.

Our SHAMS structure provides clean IP for global buyers. Our India sourcing provides diversity, volume, and cost advantage no Western marketplace can replicate.

The Model

How cllctd works
structurally.

Supply Side

India Supplies

Contributors, voice networks, worker communities, and institutional archives in India provide the raw material. Consent-verified, documented, submitted through our onboarding portal. We work with vendors to build compliant frameworks where needed.

Platform

cllctd Packages

We annotate, enrich, QA, and format raw vendor submissions into enterprise-grade datasets. Our legal team handles consent verification and rights documentation. Our sales team matches datasets with active buyer demand and manages deal flow end to end.

Demand Side

The World Buys

AI labs, robotics companies, and leading Gulf AI institutions license datasets through our SHAMS entity. Clean IP, zero tax on royalties, and direct access to buyers like leading Gulf AI institutions that need India-origin data and trust UAE legal structures.

Roadmap

Where we are.
Where we're going.

Q1 2026

Platform Launch & Pre-Seed Raise

Landing page, contributor app, and brand identity live. Pre-seed fundraise underway . First contributor and buyer conversations active.

Q2 2026

First Contributor Agreements & Enterprise Deals

Contributor agreements signed. First annotated datasets in catalogue. First enterprise licensing deal closes. Revenue begins. SHAMS entity established.

Q3 2026

Voice Network Expansion

India voice recording networks onboarded across Hindi, Tamil, Telugu, Bengali. Catalogue expands to 20+ listed datasets. Gulf sovereign buyer pipeline activated.

Q4 2026

Consumer App Launch

Mobile app launches in India — iOS and Android. UPI payouts. Task-based data contribution campaigns. First 10,000 active contributors.

2027

Series A & Scale

Series A raise. Arabic-language data expansion for Gulf buyers. Automated annotation pipeline. 50+ enterprise buyers. Recurring subscription model.

Get in Touch

Let's talk.

Whether you're a potential vendor, buyer, investor, or just curious — we reply to everything personally.

📧 General — team@cllctd.ai

📦 Contributors — team@cllctd.ai

💼 Buyers — team@cllctd.ai

💰 Investors — team@cllctd.ai

📍 Dubai, UAE · SHAMS registered

Send us a message

Your name

Email

I am a…

Message

✓ Message sent. We'll be in touch within 2 business days.

Data that wascollected.Not scraped.

Why we builtcllctd.