frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How to Evaluate IP Dataset?

1•punkpeye•1h ago
I've been researching solutions for cost effective method to enrich millions of IPs per day (only need country, city, ASN), and I came across ip-api.com.

The service is mentioned in a few random Reddit threads as cost effective option, but I cannot find any discussions about accuracy of data.

How would one even go about verifying it?

For context, I am paying at the moment a few thousand/per month to another provider for IP data. I am not really using this data for anything other than have enough data to troubleshoot customer support/fraud. This service is promising unlimited IP data for USD 16/month, which sounds too good to be true, but there are many positive reviews across the Internet, so I am trying to evaluate if it is worth adopting.

Comments

reincoder•1h ago
I work for IPinfo. We provide a free country and ASN database on a free tier with an unlimited amount of requests. You can download the entire database or use the API services. For country and ASN, it is free.

However, we do not offer city level data for free. > How would one even go about verifying it?

We believe we are the most accurate IP data provider out there, but you should come to that conclusion yourself.

I can tell you why our data is super accurate compared to the rest of the industry. The industry as a whole uses self-reported information that is offered by ASN and ISPs. It is called "geofeed". The issue with geofeed is that IP geolocation providers do not tend to verify the accuracy. Many providers just aggregate these public records and repeat what the ISPs and ASNs want them to tell them. This is a quite bad practice.

So we built a network of distributed servers (currently 1360 servers across 160 countries) that run ping, traceroute and other internet measurements and try to infer the location of IP geolocations. This means when you come to asking how do I know you are accurate, we can share our active measurement data and tell you that this is the evidence.

Now, comes the qustions of how you identify accuracy yourself.

First, if you have access to a large pool of known locations of IP addresses, you can run comparisons across different vendors. You need a GPS-backed device to locate IP addresses.

If you do not have a large pool of well-known location IPs, you can take a sample of IP addresses and check them yourself across multiple vendors. You can then use a tool like ping.sx or our own tool ipinfo.io/probenet/live to see evidence of where these IP addresses are located based on latency.

Do not bet on consensuses among IP geolocation providers; run your own tests.

Our data was evaluated by peer-reviewed academic research. You can take a look at that as well, if you want.

> I am not really using this data for anything other than have enough data to troubleshoot customer support/fraud.

Now, I will be honest...you should not pay anything to us. The way you have describing your issue, it seems like the free services we already offer that should satisfy your need.

Do you really need large scale IP address enrichment of all the IP addresses that visit your website? If yes, then for the first layer use our free data that provides ASN and country information.

Then, when you need troubleshooting with your customers, you can look up those individual IP addresses for free on our website, where we provide all our data for free access.

---

Let me know if you need any help, always happy to answer questions.

punkpeye•34m ago
So many of my open questions answered in one answer. Thank you.

A follow up based on new information - if 'geofeed' identifies something with wrong geo location, and your method detects different geolocation, what do I see as the consumer consuming your API? I am assuming the inferred data, but that also feels counter-intuitive (since the data does not align with what ASN/ISP are reporting).

How often does your active measurement data disagree with geofeed data?

How do you handle mobile/cellular IPs

> Do you really need large scale IP address enrichment of all the IP addresses that visit your website? If yes, then for the first layer use our free data that provides ASN and country information.

If I am troubleshooting a support case that is days/weeks/months old, wouldn't this mean that enriching this information at a later date may give me different data than what it was associated with at the time the requests were made? My understanding was that IPs get re-assigned.

How frequently do IP-to-location mappings change in practice?

Do you offer historical IP data snapshots?

The Most Important Software Innovations (2021)

https://dwheeler.com/innovation/innovation.html
1•birdculture•5m ago•0 comments

The Cassandra of 'The Machine'

https://www.thenewatlantis.com/publications/the-cassandra-of-the-machine
1•Hooke•6m ago•0 comments

Ask HN: The trickiest bug you've encountered?

1•chistev•8m ago•2 comments

Data Centers in Space? (With Dr. Adam Becker), 2026.03.02

https://www.buzzsprout.com/2126417/episodes/18844094-data-centers-in-space-with-dr-adam-becker-20...
1•ibobev•9m ago•0 comments

Lagrange v1.20: SOCKS5, Handheld Port, Gamepads, UI/Audio Improvements

https://gmi.skyjake.fi/gemlog/2026-03_lagrange-1.20.gmi
2•ibobev•9m ago•0 comments

NeXTWorld Interviews Bud Tribble, One of the NeXT Founders (1994)

https://computeradsfromthepast.substack.com/p/nextworld-interviews-bud-tribble
1•ibobev•9m ago•0 comments

FBI is buying data that can be used to track people, Patel says

https://www.politico.com/news/2026/03/18/fbi-buying-data-track-people-patel-00834080
2•elsewhen•9m ago•0 comments

Nvidia Dynamo 1.0 Powers Multi-Node Inference at Production Scale

https://developer.nvidia.com/blog/nvidia-dynamo-1-production-ready/
1•gmays•9m ago•0 comments

The Data Structures of Roads

https://sandboxspirit.com/blog/data-structures-of-roads/
1•matt_d•10m ago•0 comments

VibePod adds Ollama/vLLM back end support for Claude Code and Codex

https://vibepod.dev/docs/llm/
1•nezhar•10m ago•0 comments

Zettelkasten

https://en.wikipedia.org/wiki/Zettelkasten
1•simonebrunozzi•11m ago•0 comments

Val Kilmer in 'As Deep as the Grave, His Performance Was AI Generated

https://variety.com/2026/film/news/val-kilmer-ai-film-as-deep-as-the-grave-1236691042/
2•admp•13m ago•0 comments

Rep. Goldman Unveils File About Trump/Epstein Relationship [video]

https://www.youtube.com/watch?v=OLnU9IWEIgw
3•surprisetalk•14m ago•0 comments

Building Liberal Compute

https://simongrimm.substack.com/p/building-liberal-compute
1•surprisetalk•16m ago•0 comments

Soul.md

https://soul.md/
2•rishikeshs•17m ago•0 comments

Cooling Datacenters in Space – Doing the Math

https://www.patreon.com/posts/cooling-in-space-153358848
1•trothamel•18m ago•0 comments

NYC High School Student Freed After 10 Months in ICE Detention

https://www.nytimes.com/2026/03/18/nyregion/nyc-high-school-student-ice-freed.html
4•KnuthIsGod•19m ago•0 comments

22,000 Lines of Human Code. One Bug. VueCode Found It

https://vuecode.dev/blog/22000-lines-of-human-code-one-bug-vuecode-found-it
1•scillt•19m ago•0 comments

Invasion of the Body Snatchers

https://www.ahalbert.com/reviews/2026/03/18/the_body_snatchers.html
1•ahalbert4•20m ago•0 comments

Why Smart Engineers Still Miss What Makes Enterprise AI Work

https://kimura.yumiwillems.com/p/the-missing-layer-between-ai-pilots
2•yumiatlead•23m ago•0 comments

Ask HN: AI vs. .com for a Startup

1•Eawrig05•24m ago•0 comments

Israel Is Hunting Down Iranian Regime Members in Their Hideouts, One by One

https://www.wsj.com/world/middle-east/israel-iran-leadership-528c6114
5•mhb•24m ago•0 comments

Redux for Enterprise Context

https://deadneurons.substack.com/p/redux-for-enterprise-context
1•nr378•27m ago•0 comments

GitHub permanently banned my account for using Actions to validate VPN nodes

1•shray88•28m ago•1 comments

Meet the $9B AI Company Reimagining Vibe Coding

https://www.forbes.com/sites/richardnieva/2026/03/11/meet-the-9-billion-ai-company-reimagining-vi...
2•indigodaddy•30m ago•0 comments

Work_mem: It's a Trap

https://mydbanotebook.org/posts/work_mem-its-a-trap/
1•enz•31m ago•0 comments

An industrial piping contractor on Claude Code [video]

https://twitter.com/toddsaunders/status/2034243420147859716
2•mighty-fine•32m ago•0 comments

Show HN: Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX)

https://github.com/ZDisket/vits-evo
2•ZDisket•34m ago•0 comments

Every app you've built is an ETL pipeline (you just didn't call it that)

https://www.inngest.com/blog/etl-via-inngest
1•PaulHoule•35m ago•0 comments

Reasons to be pessimistic (and optimistic) on the future of biosecurity

https://www.owlposting.com/p/reasons-to-be-pessimistic-and-optimistic
2•abhishaike•36m ago•0 comments