8
Just lost $400 on a bad AI training data set
I bought a cheap data set online for a chatbot project last month, thinking it would save me time. The seller said it had 10,000 clean examples, but half were duplicates and many had wrong labels. I spent a week trying to fix it before giving up and buying a better one. Has anyone found a good place to check data set quality before you buy?
3 comments
Log in to join the discussion
Log In3 Comments
ray2101mo ago
Kaggle's public reviews saved me from a bad purchase last year.
6
hugo1531mo ago
Feel your pain, that's the classic "cheap data" trap. Always ask the seller for a small sample file first so you can spot the duplicates yourself. Saves you from learning the hard way.
1
the_viola5d ago
Yeah I once bought a "curated" dataset that was just wikipedia articles with the punctuation stripped out...
1