This is an interesting question that I sometimes find myself thinking about. I’ll start with my answer, I actually don’t know, but I can show you that all of them can be!
Since my wife is doing a Ph.D. in biology she reads and analyzes a lot of data and also decides how to present it (or criticizes how others present theirs). So it got me thinking about presenting the most common database in the world. Note that I’m not even talking about retrieving the data itself and its correctness, only the way we can analyze it.
When thinking about the question “which is the most common database?” I can compare it to asking “which is the most common language in the world?”
With languages I can present the data in many different ways, and it all depends on how we declare “common”. For example, some people may say that “Chinese” (or Mandarin) is the most common language in the world, and this is correct, assuming you count the number of people that speak that language as native speakers. However, I can define the most common language as “the language that is spoken in the largest number of countries in the world (as a first-language)”. This is a little bit harder to find on the internet, but it should be Spanish.
Another way to define “common language” is “how many people in the world can speak this language (generally, not as native speakers), and this is probably English (though I couldn’t find a reliable source for that either). And these are not the only ways I can think of to define a “common language”.
With databases it’s even more complex, and it’s more difficult to get the correct data. When someone asks me about the “most common” database, I guess I can give them several databases as an answer, and I won’t be wrong. I guess this is exactly what the marketing departments of these companies do.
The most “common database” can be:
- The one with the largest number of installations
- The one with the largest number of installations when only counting production
- The one with the largest number of installations, but only in certain organizations (enterprises for example)
- The one that holds the largest total amount of data (how you measure data will be the key here, raw data only? Do you count duplicates?)
- The one the largest number of organizations use (count only the number of organizations, not the databases)
And these are only 5 options, I can think of some more very easily…
So I guess we can find definitions for the most “common database” that will suit SQL Server, Oracle, MySQL, PostgreSQL and more as answers.
Another problem in our case, as I mentioned above, is that we don’t have all the data. The companies don’t provide this information, and even if they did, there is no way to verify it. With the free databases it’s even more difficult, as we don’t actually know when someone installs it and uses it.
So, the next time someone tells you they are the most common database in the world, ask yourself if they tell you how they measure it.
Most common would probably be SQLite
But you didn’t answer to the basic question in my post, how do you define common?