Just so you know, Turkic languages span an enormous landmass from Turkey in the west, across the Caucasus, Central Asia, Russia, China, parts of Iran, Afghanistan and Mongolia. This represents one of the largest continuous language family distributions on Earth - spanning roughly 13,000+ kilometers east-west across Eurasia.
https://taginfo.geofabrik.de/asia:mongolia/tags/building=ger
I'm also guessing your model doesn't handle yurts that are on the border of a tile.
Finally, that's a much smaller number than I expected for a country of 3 million.
172k of them? That still seems like quite a lot of yurts; certainly more yurts per capita than anyone else has.
Living away from other people and not next to anything in particular is what I associate with nomads, the heuristic of searching a radius around landmarks doesn't make sense to me. I scrolled around a random remote desert area in Mongolia on Google Maps and found a yurt every couple of minutes.
172.7k yurts. Assuming that these are family residences for the most part, if we take an average occupancy of 4 (which is probably too low - the fertility rate is still quite high there) gives ~691k people living in yurts - approximately 20% of the population of 3.5 million - sounds reasonable.
From my memory: 3 million people, 1.5 living in the capital.
Let's say 1 million are living outside cities.
4 people per yurt.
250,000 yurt.
Add some extra yurts because there will be people having more than one or people living in a house with a yurt in the garden or yurts used as warehouses, etc
300,000 which is almost the double of the count from the ML app.
To start, OSM doesn't use Google Maps imagery for annotation due to licensing concerns. As someone else mentioned, it's rarely clear whether researchers have the right to use Maps imagery let alone download/re-publish it. Part of the reason is that Google sub-licenses imagery from several different providers who are usually extremely protective of IP. So immediately you'd have image/label alignment issues.
Even if you had access to the image that someone used for labeling, it's non-trivial. They might not even have used an image! For example you might walk around and take a GPS reading next to every object and use the keypoints as object centers. Sometimes the annotation quality is low, for example if you want to try using building outlines or roads as segmentation targets for aerial imagery. Or things are simply misaligned. Also since yurts are inherently mobile, you might not even be able to use those labels because objects have moved and there's no guarantee they'll be present in Google Maps.
Finally you'd have issues of omission/commission, because you would have to assume that OSM is complete. That's very sensitive to how active the local community is. Some places are accurate down to the fire hydrant. Where I live, there are plenty of unmapped businesses that have been here for years. Though you could definitely use it to cross-check your own labels + predictions.
The standard for detecting objects on tiles is to discard border predictions and rely on overlap (sliding window) prediction + non max suppression (NMS) to handle duplicates. The overlap is usually something like 1x receptive field of your model, and your "discard" region is a bit larger than your max expected object size.
I'm curious what the topology/architecture of the DL model is like. And are there better ways to approach this problem?
Yes, the syntax is ambiguous, but ambiguously-parseable sentences happen all the time in all languages and we resolve the ambiguity using context clues, which in this case is easy to do.
It's not wrong, but possibly ambiguous, and I'd bet an English teacher would prefer it was phrased differently. In speech, I wouldn't bat an eye at that arrangement. But, if I were to write the headline, it would have been...
"I used machine learning to count all the yurts in Mongolia."
or even just add a comma at the right place:
"I counted all of the yurts in Mongolia, using machine learning"
"One morning I shot an elephant in my pajamas. How he got into my pajamas I'll never know."
Yes, when the phrase is ambiguous, it’s usually more coherent to simply change the sentence order as you’ve done here.
*edit (I mean this sincerely, it made me laugh and I did not see it at first)
For that money, you get a well-isolated easily movable tiny house in a country where you are allowed to settle everywhere (but if you have 2000 sheep with you, you should better discuss the usage of the pastureland with the locals) without paying rent (outside the city).
Choosing a ger for housing is not only about tradition and culture. It is quite rational in that situation.
This is anti-information. People reading this uncritically will come away with completely wrong ideas about the number of yurts in Mongolia, about machine learning algorithms, about data science in general.
Who is harmed by carrying around a mistaken number for this, especially if they notice the 40% confidence?
As to the rest, I read it as an application of tools for an interesting question, not a comprehensive or authoritative how-to. It’s scaled napkin math, and napkin math is very useful.
https://plato.stanford.edu/entries/ethics-manipulation/
But to answer you directly:
- Whoever hires the author for their software engineering or data science expertise in part because of this blog post will pay for substandard work.
- By deceiving their audience as to the accuracy and precision of the demonstrated techniques, the author undermines the audience's ability to make good decisions about when to use or how to reason from the results of machine learning algorithms.
- The author disrespects their audience when they misrepresent themself, their work, and their results.
How should one interpet the "prediction score"?
When used in applications (like this one), the user typically establishes a confidence threshold and then every detection above that threshold is treated as a positive detection, the rest are discarded. The choice can be arbitrary or (sorta) principled.
Counting all of the yurts that happen to be using machine learning is a way more difficult problem.
I know of some government entities in Australia doing similar work, but the effectiveness/quality level of the author's work do make me despair for our government a bit. They're blowing years of Very Expensive Consultant spend and they can't even classify an entire parcel of land correctly, let alone count some little yurt-shaped blobs.
furyg3•7mo ago
Changing climate (desertification) and economic conditions have meant that a lot of people have given up their nomadic lifestyle and moved to cities or their outskirts (mostly Ulaanbaatar). They often are reluctant to do so, it's a big step, and they often hope it is a temporary one.
They set up their yurts not only because of housing shortages, but many are also hesitant to move into apartments or other permanent structures as it's seen as the last step in giving up this nomadic lifestyle. Often they are setting up their yurts next to permanent structures, either because they are living in the 'yard' of relatives or to expand their residences and stay connected to their culture.
You can see examples of this in the first images.
Cthulhu_•7mo ago
AlotOfReading•7mo ago
There's a fairly large domestic tourism industry catering to urban city-dwellers who want to go live in a nice ger for a couple weeks to feel connected to their history.
qq66•7mo ago
> When ineffective policy results in a large chunk of the populace generationally living in yurts on the outskirts of urban areas, it’s clear that there is failure.
That's not at all clear.
aaron695•7mo ago
LLMs agree with OP. It's a failure, with important culture.
Steelmanning it, it's better than a corrugated metal shanty town. Although they would die in the cold.
The rich in the gers burn coal, the poor plastic. There is no water or sewerage.
It's one of the most polluted capitals in the world - https://www.unicef.org/mongolia/environment-air-pollution#:~...
Ulaanbaatar - https://geohack.toolforge.org/geohack.php?params=47_56_7_N_1...
ty6853•7mo ago
TimorousBestie•7mo ago
umanwizard•7mo ago
mlinhares•7mo ago
potato3732842•7mo ago
(both groups are trash, IMO)
kragen•7mo ago
potato3732842•7mo ago
alwa•7mo ago
Breza•7mo ago
Prompt: "Mongolians still live in gers even in 2025... WTF?! That seems like a total policy failure. Help me understand why it's not just a dumb culture being dumb?"
Beginning of response: "The sight of traditional gers, or yurts, persisting as common dwellings in Mongolia, even within the cityscapes of 2025, is not an indication of a 'dumb culture' or a simple 'policy failure.' Rather, it is a complex tapestry woven from deep-seated cultural identity, stark economic realities, and the practicalities of a unique way of life. To understand the prevalence of the ger is to understand the soul of Mongolia itself."
It goes on like that for a few more paragraphs that are, to be honest, quite convincing.
orbital-decay•7mo ago
sfn42•7mo ago
datameta•7mo ago
tough•7mo ago
helpfulclippy•7mo ago
throwup238•7mo ago
codesnik•7mo ago
https://en.wikipedia.org/wiki/Toshhovli_Palace
https://en.wikipedia.org/wiki/Toshhovli_Palace#/media/File:K... that circular spot.