Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
Unlike traditional search engines that present ranked lists of webpages, generative search engines rely solely on in-line citations as the key gateway to real-world information exposure, making it crucial to examine whether LLM-based citation generation introduces bias—particularly for politically sensitive queries. In this work, we introduce AllSides-2024, a newly constructed dataset comprising the latest real-world news articles labeled with left- or right-leaning stances for investigating potential political bias in LLM-generated citations. Through systematic evaluations, we find that LLMs exhibit a consistent tendency to cite left-leaning sources at notably higher rates compared to traditional retrieval systems. Controlled experiments further reveal that this bias arises from a preference for media outlets identified as left-leaning, rather than for left-oriented content itself. Meanwhile, our findings show that while LLMs struggle to infer political bias from news content alone, they can almost perfectly recognize the political orientation of media outlets based on their names. These insights highlight the risk that, in the era of generative search engines, information exposure may be disproportionately shaped by specific media outlets, further shaping public perception and decision-making.