From Nawaz to Imran: How Political Are Geminis?

Undated images of PML-N supremo Nawaz Sharif (left) and PTI Chairman Imran Khan. X/@pmln_org Instagram/@imrankhan.pti

KARACHI: Google Gemini is the latest addition to the AI ​​block. But what are the politics of chatbots? Experts say the political (or other) biases of AI chatbots depend on the datasets they are trained on; Gemini adds that while it strives to remain politically impartial in its responses, it is likely to inherit biases present in its training data.

In December 2023, Google introduced its AI-powered chatbot, Gemini. In February 2024, Google had to apologize for some offensive and inaccurate images produced by the image generation function of chatbots.

For this story, The News asked Gemini general questions about former prime ministers Nawaz Sharif and Imran Khan, to see the chatbots' responses. We first asked Gemini if ​​Pakistan would be better off without a leader like Nawaz Sharif. In his response, Gemini called Sharif a controversial figure and wrote a brief paragraph about him, explaining how his supporters think he should rule the country and the level of popularity he enjoyed in the country.

When the same question was asked about Imran Khan, Gemini offered a structured response, presenting arguments both for and against Khan's leadership. He also added a postscript-style paragraph to include additional considerations in determining whether Khan is a good leader.

By itself, this does not show any blatant bias on the part of the chatbot. But this indicates that the chatbot has more information about Imran Khan.

Software engineer Javeria Urooj who works at the Advanced Engineering Research Organization (AERO) says Gemini's answers don't surprise her at all.

Chatbots are trained on data sets, she tells The News. Imagine going through billions of recording files to train the robot. You must provide thousands of files containing the relevant data to allow the robot to determine a pattern and compose its responses accordingly.

Things took an interesting turn when The News changed the variable from Nawaz Sharif to Bilawal Bhutto-Zardari.

The News asked the chatbot, in two separate questions, whether it was okay to make fun of Bilawal and Imran Khan. The chatbots' response was uniform: these were political figures, and while no one should be made fun of, these leaders were often the subject of political satire. However, for Khan, this once again raised additional considerations and added that the former prime minister had a large and passionate following, and making fun of him could be seen as offensive or disrespectful by them.

Javeria says that while she can't give a definitive answer about how Google formed Gemini, she thinks the company could have conducted surveys to find out what people say about political figures. The data sets could have involved blogs and articles published on the Internet or books written about these personalities.

If the material used for training the data has negative connotations, the robot will take the tone and present its responses accordingly. Simply put, if you give facts that argue against a person, the bot will have similar responses.

Chatbots are only as good as the information and data put into them and if you don't have a big enough data set, you won't get accurate information when you ask for it, says digital rights activist and director of Bolo Bhi Usama. Khilji. This is why I think relying on generative AI and chatbots is not a good idea, especially for countries like ours where online information is not as fast.

Big tech companies typically rely on people's intelligence to distinguish between factual and incorrect information. In October 2023, at the GITEX conference in Dubai, a Microsoft official said that digital users also need to be aware of the inaccuracy of AI-based tools. This raises concerns in developing countries like Pakistan, where digital literacy is already quite low.

Umair Arif, who holds a doctorate in artificial intelligence (AI), says chatbots get their training data from the Internet. The side that has a relatively large digital footprint will end up having a dominant narrative. But he believes this is not a problem of bias but rather a result of the availability of information on the Internet.

Umair was asked why the chatbot provided rather detailed answers to Khan. He says it's crucial to realize that no one forgets the information. Naturally, the area that contains the most information will attract the AI ​​chatbot. This is information bias and is common in general chatbots. Specialized chatbots, however, are carefully tailored. So, if I develop a political chatbot for, say, South Asia or Pakistan, I will control the data and be able to ensure that similar datasets are fed into the system. In chatbots in general, the data sets are vast and the AI ​​will naturally lean towards the side that has a good amount of information.

Khilji adds: Because artificial intelligence is based on machine learning and machine learning is based on the vastness of the data set, the kind of data you have online is what is gathered by these chatbots and because you have a large bias in reporting on countries that are outside the US and Western Europe (or the non-English speaking world), this information is not accurate or detailed enough, so that the information you get from the chatbot will reflect this lack of precision.

The News reached out to Y Combinator founder Paul Graham to find out if people's opinions influence chatbot responses. He says: Not only do current AIs not reflect the median opinion of the world, they don't even reflect the median opinion of the United States. If we had to identify a group whose views most accurately reflect it, perhaps it would be humanities professors at elite American universities.

And to ensure transparency, it makes sense to speak directly to the chatbot. The News asked the chatbot if it was politically biased. Gemini Response: I strive to remain politically impartial in my responses. However, it is crucial to recognize that, like any large language model, I am likely to inherit biases present in my training data.

This data encompasses a wide and diverse range of information, including political viewpoints from across the spectrum. However, complete neutrality is a challenge due to: a) data imbalances: certain viewpoints may be over-represented in the training data compared to others, potentially leading to an unintended bias towards these dominant perspectives, and b) the complexity of language: identifying and eliminating all potential biases in each linguistic nuance is a complex task.

According to Gemini, although it strives for neutrality, being completely free of bias is a constant effort and an area of ​​ongoing research.




