Coder Social home page Coder Social logo

pihole-antitelemetry's Introduction

pihole-antitelemetry

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block both using these pihole lists.

Project Goals and FAQ

The purpose and goals of this blocklist are outlined in the FAQ

The lists

For a list of domains that should not break anything, use telemetry-domains.txt.

If you want to also block domains that are not well-tested, add telemetry-domains_beta.txt to the list.

Domains that are known to break user-facing services or apps are moved to telemetry-domains_borked.txt for use at your own risk.

These lists are maintained with help from the community. Any efforts to help sort domains into the correct blocklists will help others better block non-consensual telemetry.

Import blocklist

On the Pi-hole admin page, go under Group management > Adlists > Add a new adlist

You can add a new list either by URL or by local path:

( ! ) To apply the changes you need to run pihole -g or update your gravity list online: http://<YOUR-PIHOLE_SERVER>/admin/gravity.php

Existing lists

pihole-antitelemetry's People

Contributors

danifr avatar jk avatar moralcode avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

pihole-antitelemetry's Issues

Blocking gstatic.com

While google does track you with this site, it's mostly used ad a CDN, breaking a lot of sites

mesu.apple.com domain

adding mesu.apple.com might be a good idea. It seems to ping a lot though it also seems to break downloading apps made by apple such as “news”

What other blocklist formats should be supported?

What other blocklist formats are out there that people would like to see supported?

This is primarily a voting issue for people to leave a comment to mention their preferred format and upvote ones they would like to see added.

Create a script to help automate simple changes to the csv domain list

the UPDATING guide in this repo details some quirks and weird workarounds regarding how the CSV should best be edited.

It may be helpful to create an additional python script that allows a domain name and list name to be passed as a parameter so it can automatically update the list of every instance of that domain to the new list.

Related: #9

Domain name formatting

as mentioned in the UPDATING file, there are some domains in the list that represent the same domain but differ by a commonly confused character (like dashes, em dashes, and en dashes). Theres also a few that look like two characters smushed together (I think there's one like that for fi in a couple places). These are likely due to copy-paste errors or from parsing the PDF, however I want to be sure that these different domains have no chance of being valid or in-use domains at all before removing them as this may be a tactic used by google/Apple .etc to evade domain-based blocks using common confusable characters.

For this issue I would like to:

  • look into the validity criteria for a domain name to determine what unicode/funky characters are or are not allowed by the spec
  • replace these characters with the canonically correct version(s) and update the id field for any groups of entries that represent the same domain and are merged with an existing category

This will likely be necessary to have complete before #10 will be able to work properly as that relies on having a consistent domain list

Validation of block lists

I think it would be both interesting and useful to replicate some of the methods used from the linked papers as a means of validating domains in the beta and borked lists to gather some evidence to promote them to the main list.

That said there are other ways to help improve these lists, namely just by adding the beta and/or borked lists to your own pihole installation and report any domains that are in a list that they should not be in (i.e. domains in beta that break apps and services).

Since this repository is mainly intended as a community resource to gather information on these domains, any contributions you can make will ultimately benefit other pihole users and blocklist creators.

find a better way to manage and maintain the spreadsheet

as seen by #4, the domains.csv is a spreadsheet of all the domains that were either part of the original paper or were submitted by users to this repo or on Reddit. Maintaining it in a CSV seems a bit cumbersome, so I'm looking for alternative ways to maintain this list as a community resource, ideally under version control.

Looking for ideas for how to do this. Using a shared, public-comment google doc seems pretty counter to the goals of this repo (and Id prefer an open, or at least minimally harmful-to-users platform anyway). Currently thinking of using a GitHub Wiki for this but maintaining it as a markdown table doesn't seem very portable/usable to other blocklist maintainers.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.