I’m a grad student and need to cover some tuition this semester. I’ve
been scraping freelance job boards for a while and turned it into
a dataset.
Data:
- 134,861 unique postings (Apr–Nov 2025)
- Fields: description, hourly/fixed budget, client country, tech tags
- Format: SQLite + CSV; scraped with Python (Selenium/Scrapy) with basic
dedup/normalization (currencies, month-level timestamps, no company names/
URLs)
Sample insight: In this scrape, CV gigs pay more than LLM ones: median fixed
budgets ≈ $3k vs ≈ $2k (Apr–Nov 2025; n≈270 CV, n≈18k NLP).
Free sample (50 rows): https://docs.google.com/spreadsheets/
d/1wvvX8JeOIEO2rYeuMf99LEBN4bmVoZ3ZQqHY_clvCvc/edit?gid=0#gid=0
Full dataset: https://noctscraper.gumroad.com/l/freelance-ai-dataset
Happy to answer questions about the schema or run a quick aggregate if
you’re curious.