cat > Bourne.identity #! /bin/bash echo "$*" ^D
The contrived scenario is just a starting point. One can make more and more sophisticated ecological statistics models about the situation.
Regarding why estimate at all knowing they can be wrong ? Estimates are very useful for planning. Sophisticated models would also yield probabilities of over and underestimated, these combined with cost of over and underestimation errors are very useful for decision making.
See the German tank problem. Turns out the allied forces overestimated the number to f tanks left, still helped in planning.
Consider a contrived scenario where an opaque jar contains N distinguishable marbles. You take one out and note it's type and put it back in. You repeat this n times. If k out of n are unique it conveys information about N.
If, for example, k=1 then N is likely small. On the other hand if k=n then N is likely large.
The most computer-sciencey way is to look at n at which you get a repeat, ah! a hash collision.
One can make these ideas more quantitative under assumptions about the numbers of each types of marbles.
The math of hashing, birthday paradox, coupon collection and hyperloglog are good places to start.
Then there are other ways. Two of you count the number of typos in a tedious text. One says N the other says n and out of them only k are common. From this you can estimate the likely number of typos in the text.
This project is an enhanced reader for Ycombinator Hacker News: https://news.ycombinator.com/.
The interface also allow to comment, post and interact with the original HN platform. Credentials are stored locally and are never sent to any server, you can check the source code here: https://github.com/GabrielePicco/hacker-news-rich.
For suggestions and features requests you can write me here: gabrielepicco.github.io