What’s in a name: A study of names, gender inference, and gender behavior in facebook

Cong Tang, Keith Ross, Nitesh Saxena, Ruichuan Chen

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    In this paper, by crawling Facebook public profile pages of a large and diverse user population in New York City, we create a comprehensive and contemporary first name list, in which each name is annotated with a popularity estimate and a gender probability. First, we use the name list as part of a novel and powerful technique for inferring Facebook users’ gender. Our name-centric approach to gender prediction partitions the users into two groups, A and B, and is able to accurately predict genders for users belonging to A. Applying our methodology to NYC users in Facebook, we are able to achieve an accuracy of 95.2% for group A consisting of 95.1% of the NYC users. This is a significant improvement over recent results of gender prediction [14], which achieved a maximum accuracy of 77.2% based on users’ group affiliations. Second, having inferred the gender of most users in our Facebook dataset, we learn several interesting gender characteristics and analyze how males and females behave in Facebook. We find, for example, that females and males exhibit contrasting behaviors while hiding their attributes, such as gender, age, and sexual preference, and that females are more conscious about their online privacy on Facebook.

    Original languageEnglish (US)
    Title of host publicationDatabase Systems for Adanced Applications - 16th International Conference, DASFAA 2011, International Workshops
    Subtitle of host publicationGDB, SIM3, FlashDB, SNSMW, DaMEN, DQIS, Proceedings
    EditorsJianliang Xu, Ge Yu, Shuigeng Zhou, Rainer Unland
    PublisherSpringer Verlag
    Pages344-356
    Number of pages13
    ISBN (Print)9783642202438
    DOIs
    StatePublished - 2011
    Event16th International Conference on Database Systems for Advanced Applications, DASFAA 2011 - Hong Kong, China
    Duration: Apr 22 2011Apr 25 2011

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume6637 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Other

    Other16th International Conference on Database Systems for Advanced Applications, DASFAA 2011
    Country/TerritoryChina
    CityHong Kong
    Period4/22/114/25/11

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'What’s in a name: A study of names, gender inference, and gender behavior in facebook'. Together they form a unique fingerprint.

    Cite this