The purpose of data mining is to discriminate …
Certain kinds of discrimination are unethical, and illegal
But it depends on the context
Anonymization is harder than you think
When Massachusetts released medical records summarizing every state employee’s hospital record in the mid‐1990s, the governor gave a public assurance that it had been anonymized by removing all identifying information such as name, address, and social security number.
He was surprised to receive his own health records (which included diagnoses and prescriptions) in the mail.
Using publicly available records:
Netflix movie database: 100 million records of movie ratings (1–5)
In 2006, a text file was released on the web containing 20,000,000 search engine queries made by 650,000 users over a 3-month period, intended for research purposes. The file had been anonymized by replacing user names with random numbers, one per user. However, some of the queries contained clues to the user's identity. The New York Times was able to locate an individual from these supposedly anonymized search records by cross referencing them with phonebook listings. Look up this renowned example of reidentification and read about it. What is the name of the user identified by the New York Times?
Stanford Researchers: It Is Trivially Easy to Match Metadata to Real People
Every data that is device scoped such as a mac address, browser fingerprint is not privacy proof as they enables tracking
You can control your privacy by google at:
Pgp (Pretty Good Privacy) is a cryptographic protocol aimed to increase privacy in the digital world.