Ian Kennedy
2026
Social Construction of Urban Space: Using LLMs to Identify Neighborhood Boundaries From Craigslist Ads
Adam Visokay | Ruth Bagley | Chris Hess | Ian Kennedy | Kyle Crowder | Rob Voigt | Denis Peskoff
Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science
Adam Visokay | Ruth Bagley | Chris Hess | Ian Kennedy | Kyle Crowder | Rob Voigt | Denis Peskoff
Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science
Rental listings offer a window into how urban space is socially constructed through language. We analyze Chicago Craigslist rental advertisements from 2018 to 2024 to examine how listing agents characterize neighborhoods, identifying mismatches between institutional boundaries and neighborhood claims. Through manual and large language model annotation, we classify unstructured listings from Craigslist according to their neighborhood. Further geospatial analysis reveals three distinct patterns: properties with conflicting neighborhood designations due to competing spatial definitions, border properties with valid claims to adjacent neighborhoods, and “reputation laundering" where listings claim association with distant, desirable neighborhoods. Through topic modeling, we identify patterns that correlate with spatial positioning: listings further from neighborhood centers emphasize different amenities than centrally-located units. Natural language processing techniques reveal how definitions of urban spaces are contested in ways that traditional methods overlook.
2021
Reconsidering Annotator Disagreement about Racist Language: Noise or Signal?
Savannah Larimore | Ian Kennedy | Breon Haskett | Alina Arseniev-Koehler
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media
Savannah Larimore | Ian Kennedy | Breon Haskett | Alina Arseniev-Koehler
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media
An abundance of methodological work aims to detect hateful and racist language in text. However, these tools are hampered by problems like low annotator agreement and remain largely disconnected from theoretical work on race and racism in the social sciences. Using annotations of 5188 tweets from 291 annotators, we investigate how annotator perceptions of racism in tweets vary by annotator racial identity and two text features of the tweets: relevant keywords and latent topics identified through structural topic modeling. We provide a descriptive summary of our data and estimate a series of generalized linear models to determine if annotator racial identity and our 12 latent topics, alone or in combination, explain the way racial sentiment was annotated, net of relevant annotator characteristics and tweet features. Our results show that White and non-White annotators exhibit significant differences in ratings when reading tweets with high prevalence of particular, racially-charged topics. We conclude by suggesting how future methodological work can draw on our results and further incorporate social science theory into analyses.