Finding Sites with Trackers: Law & Finance Firms
Finding 100s of Trackers Across 100K+ Sites: For Research and Litigation
Are you looking to find all of the sites with a specific tracker or trackers on them? This sounds like it might be an easy problem, but the scale can be daunting.
Manually going through websites is time-consuming; HTML source rarely shows full tracking; network traffic often overlooked; establishing chain of custody for litigation is difficult.
If you’re looking for any specific tracker on large sites, we can help you create this list. We track over 100K of the largest commercial websites and extract out all of the trackers on them. We can then give you a list of all of the sites with the tracker on them from this list.
- Shorten research time from months to days or weeks
- Gain insight into 3rd-party data flows you didn’t previously know about
- Strengthen your evidentiary basis before filing suit or making investment decisions
When Is Knowing Trackers Useful?
Knowing which sites have which trackers on them is exceedingly useful for certain industries. It’s a very niche endeavor, but can be used in the legal industry to find companies that are breaking laws by sharing personal information with 3rd parties (this is the main use case that we work with our clients on for this specific data).
If you’re a law firm looking to identify targets for class actions that are illegally sharing information with 3rd parties, we’d be happy to help.
Additionally, this data can be used to gauge the expansion of these service providers. This is useful in a few niche cases when gauging the growth of tracking companies for investment.
What Trackers Can Be Tracked
Essentially, we can find almost any tracker across our scanned list of sites. This includes the more common trackers like the Meta/Facebook pixel, Tiktok pixel, LinkedIn pixel, and Google Analytics, as well as the copious amounts of advertising pixels on many of the larger sites. The list of trackers scales into the hundreds of tracking domains, but we can scan for them all.
Essentially, if you were to come to us and ask for a list of sites with xxxxxx.com domain tracker on it, we could most likely provide it.
Collecting Tracker Data
Our experience working with web crawlers for our clients has given us years to develop some pretty unique web crawling technology. We’ve specifically tailored our technology to allow us to examine all of the network traffic on the homepages of 100K+ sites, as well as custom list of sensitive industries. From this, we can extract what data is being sent to 3rd parties from a site when visiting the homepage (or other pages).
In the past, we’ve helped our law firm clients successfully identify, document, and litigate dozens of cases (probably bordering on 100+ cases). While we can’t get into the specifics with our current clients, we have extensive experience turning your custom legal theories into actionable lists of target websites (or emails, or mobile apps) that meet your theory.
Some More Web Crawler Specifics
We use a rather involved tech stack to collect which sites use which trackers. You can’t just download the HTML of the webpage; you have to actually run a browser and remotely control it. Additionally, in order to make sure you’re collecting everything you need to save all of the network data.
This allows you to see the finer distinctions between whether the tracker is simply loading on the page or whether it’s really running and potentially sharing any data. It’s important to collect everything; we’ve found this out over our work with our clients.
What You’ll Receive: List and Documentation
If you decide to work with us, we will work with you to identify what information you need and in what formats. If you simply need numbers (e.g. tracker xyz is on ### sites), we’d be happy to provide you a summary of this, as well as a spreadsheet of the identified websites.
If you need documentation that the tracker is actively on the site and that it is sharing specific information with a 3rd party, we can provide you with screenshots, the HAR file from our testing that shows all of the network traffic, as well as video if required. In essence, we will do our best to get you everything that you need to not only document that the site is using the tracker, but to be able to show the issue on-demand later, even if the site later removes the tracker.
Let’s Get Started
Looking for a list of sites with a specific tracker? Let us know below and we’ll be in touch!