An empirical study of web cookies

Aaron Cahn, Scott Alfeld, Paul Barford, S. Muthukrishnan

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Web cookies are used widely by publishers and 3rd parties to track users and their behaviors. Despite the ubiquitous use of cookies, there is little prior work on their characteristics such as standard attributes, placement policies, and the knowledge that can be amassed via 3rd party cookies. In this paper, we present an empirical study of web cookie characteristics, placement practices and information transmission. To conduct this study, we implemented a lightweight web crawler that tracks and stores the cookies as it navigates to websites. We use this crawler to collect over 3.2M cookies from the two crawls, separated by 18 months, of the top 100K Alexa web sites. We report on the general cookie characteristics and add context via a cookie category index and website genre labels. We consider privacy implications by examining specific cookie attributes and placement behavior of 3rd party cookies.We find that 3rd party cookies outnumber 1st party cookies by a factor of two, and we illuminate the connection between domain genres and cookie attributes. We find that less than 1% of the entities that place cookies can aggregate information across 75% of web sites. Finally, we consider the issue of information transmission and aggregation by domains via 3rd party cookies. We develop a mathematical framework to quantify user information leakage for a broad class of users, and present findings using real world domains. In particular, wedemonstrate the interplay between a domain's footprint across the Internet and the browsing behavior of users, which has significant impact on information transmission.

    Original languageEnglish (US)
    Title of host publication25th International World Wide Web Conference, WWW 2016
    PublisherInternational World Wide Web Conferences Steering Committee
    Pages891-901
    Number of pages11
    ISBN (Electronic)9781450341431
    DOIs
    StatePublished - 2016
    Event25th International World Wide Web Conference, WWW 2016 - Montreal, Canada
    Duration: Apr 11 2016Apr 15 2016

    Publication series

    Name25th International World Wide Web Conference, WWW 2016

    Other

    Other25th International World Wide Web Conference, WWW 2016
    CountryCanada
    CityMontreal
    Period4/11/164/15/16

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Software

    Fingerprint Dive into the research topics of 'An empirical study of web cookies'. Together they form a unique fingerprint.

  • Cite this

    Cahn, A., Alfeld, S., Barford, P., & Muthukrishnan, S. (2016). An empirical study of web cookies. In 25th International World Wide Web Conference, WWW 2016 (pp. 891-901). (25th International World Wide Web Conference, WWW 2016). International World Wide Web Conferences Steering Committee. https://doi.org/10.1145/2872427.2882991