How Google Collects Data About You and what you need to know about Internet (Part 1)

How Google Collects Data About You and what you need to know about Internet (Part 1)
Google, perhaps more than any other company has realized that information is power. Information about the Internet, information about innumerable trends, and information about its users—YOU.
So how much does Google know about you and your online habits? It’s only when you sit down and actually start listing all of the various Google services you use on a regular basis that you begin to realize how much information you’re handing over to Google.
Let’s have a look at how Google is gathering information from you, and about you.
Google’s Information-Gathering Channels
The stated mission of Google is “to organize the world’s information and make it universally accessible and useful,” and it’s making good on this promise. However, Google is gathering even more information than most of us realize.
Searches (web, images, news, blogs, etc.) – Google is, as you all know, the most popular search engine in the world, with a market share of around 92% as of February 2019 according to StatCounter (for example, 88% of searches in the U.S. are made on Google). Google tracks all searches, and now with search becoming more and more personalized, this information is bound to grow increasingly detailed and user-specific.
Clicks on search results – Not only does Google get information on what we search for, but it also gets to find out which search results we click on. Additionally, it knows the relation between clicks and impressions (click-through-rate, or CTR) for all search results, and may use CTR for search rankings. The result is that often-clicked websites get higher in the search results.
Web crawling – Googlebot, the Google web crawler, is a busy bee, continuously reading and indexing billions of web pages.
Website analytics – Google Analytics is by far the most popular website analytics package out there. Since it’s free and still supports a number of advanced features, it’s used by a large percentage of the world’s websites.
Ad serving – Google Ads (known as Google AdWords until July 2018) and Google AdSense are cornerstones of Google’s financial success, but they also provide Google with much valuable data. Which ads are people clicking on, which keywords are advertisers bidding on, and which ones are worth the most? All of this is useful information.
Email – Gmail is one of the three largest email services in the world, together with competing options from Microsoft (Outlook) and Yahoo. Interestingly, email content, both sent and received, is parsed and analyzed not just by Google itself, but also by third-party apps.
G Suite (Docs, Sheets, Slides, Calendar, Drive, etc.) – The Google office suite has many users and is, of course, a valuable data source to Google.
Google Public DNS – The Google DNS service doesn’t just help people get fast DNS lookups; it helps Google too, because it can get a ton of statistics from this, such as which websites people access.
Google Chrome – What is your web browsing behavior? What sites do you visit? Google has access to everything related to web browsing through the massive Chrome’s user base. According to Statista, as of August 2018, several versions of the Chrome Browser (including mobile and desktop) had around 55% market share globally.
Android OS – The world’s most used mobile OS is an immense source of data that Google has access to.
Google Pixel – Google launched its line of Android-based smartphones back in 2016. The most recent model is the highly-praised Google Pixel 3 released in October 2018.
Google Assistant and Google Home – Google also offers an AI-based virtual assistant available on phones and smart home devices, such as the Google Home speaker.
Chrome OS – While not nearly as successful as the Android OS, Chrome OS-based laptops, aka Chromebooks, are available in many corners of the globe.
Google Finance – Aside from the finance data itself, what users search for and use on Google Finance is sure to be valuable data to Google.
YouTube – The world’s largest and most popular video site by far is, as you know, owned by Google. It gives Google a huge amount of information about its users’ viewing habits.
Google Translate – This helps Google perfect its natural language parsing and translation.
Google Books – Not huge for now, but it has the potential to help Google figure out what people are reading and want to read.
Google Flights – Launched in 2011, Google Flights is a search service where users can find and book airline tickets through third-party providers.
Google Maps and Google Earth – What parts of the world are you interested in?
Your contact network – Your contacts in Google Meet, Gmail, YouTube, etc., make up an intricate network of users. And if those contacts also use Google, the network can be mapped even further. We don’t know if Google does this, but the data is there for the taking.
And the list could go on since there are even more Google products out there, but we think by now you’ve gotten the gist of it.
Much of this data is anonymized, but not always right away. Advertising data in server logs is kept for nine months, and cookies (for services that use them) aren’t anonymized until after 18 months. Even after that, the sheer amount of generic user data Google has on its hands is a huge competitive advantage against most other companies—a veritable gold mine.