Домой United States USA — software Assessing political bias in language models

Assessing political bias in language models

По

May 31, 2023

129

The language models behind ChatGPT and other generative AI are trained on written words that have been culled from libraries, scraped from websites and social media, and pulled from news reports and speech transcripts from across the world. There are 250 billion such words behind GPT-3.5, the model fueling ChatGPT, for instance, and GPT-4 is now here.
The language models behind ChatGPT and other generative AI are trained on written words that have been culled from libraries, scraped from websites and social media, and pulled from news reports and speech transcripts from across the world. There are 250 billion such words behind GPT-3.5, the model fueling ChatGPT, for instance, and GPT-4 is now here.

Now new research from Stanford University has quantified exactly how well (or, actually, how poorly) these models align with opinions of U.S. demographic groups, showing that language models have a decided bias on hot-button topics that may be out of step with general popular sentiment.
«Certain language models fail to capture the subtleties of human opinion and often simply express the dominant viewpoint of certain groups, while underrepresenting those of other demographic subgroups,» says Shibani Santurkar, a former postdoctoral scholar at Stanford and first author of the study. «They should be more closely aligned.»
In the paper, a research team including Stanford postdoctoral student Esin Durmus, Columbia Ph.D. student Faisal Ladhak, Stanford Ph.D. student Cinoo Lee, and Stanford computer science professors Percy Liang and Tatsunori Hashimoto introduces OpinionQA, a tool for evaluating bias in language models. OpinionQA compares the leanings of language models against public opinion polling.
As one might expect, language models that form sentences by predicting word sequences based on what others have written should automatically reflect popular opinion in the broadest sense.

Assessing political bias in language models

ЕЩЁ БОЛЬШЕ НОВОСТЕЙ

公明赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国警察庁長官らの逮捕状請求内乱の疑い「非常戒厳」めぐり

今年の漢字「金」パリ五輪、裏金問題、物価高京都・清水寺で発表

ПОПУЛЯРНАЯ КАТЕГОРИЯ

СХОЖИЕ СТАТЬИБОЛЬШЕ ОТ АВТОРА

The voice actor of Geralt backs down from previous statements about The Witcher 4

Jason Schreier: Grand Theft Auto VI likely to be delayed

Tangled is the next animated Disney movie to get a live-action remake

ЕЩЁ БОЛЬШЕ НОВОСТЕЙ

公明 赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国 警察庁長官らの逮捕状請求 内乱の疑い「非常戒厳」めぐり

今年の漢字「金」 パリ五輪、裏金問題、物価高 京都・清水寺で発表

ПОПУЛЯРНАЯ КАТЕГОРИЯ

СХОЖИЕ СТАТЬИ БОЛЬШЕ ОТ АВТОРА

公明赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国警察庁長官らの逮捕状請求内乱の疑い「非常戒厳」めぐり

今年の漢字「金」パリ五輪、裏金問題、物価高京都・清水寺で発表