Emergence
Posts
The AI War: Ethical Struggles and Technological Challenges in the Era of Language Models

The AI War: Ethical Struggles and Technological Challenges in the Era of Language Models

Franck Stéphane Ndzomga
May 13, 2023

The technological battle for AI dominance took a significant leap forward at the Google I/O event. Google announced the forthcoming integration of its latest language model, PaLM 2, into several of its products, including Google Docs, Sheets, and Slides. While Google has been somewhat tardy in joining the large language model (LLM) integration race, preferring a more cautious approach, the intense competition from tech giants like OpenAI and Microsoft compels it to accelerate its efforts in this field.

The AI war has been ongoing since the launch of ChatGPT for the general public. As with any significant shift in technology, the advancement in LLMs comes with its collateral impacts - here, the primary concerns are ethics and safety. LLMs, prone to "hallucination," can generate factually incorrect sentences, posing potential challenges to information integrity and responsible usage. Without proper controls, they can produce ethically problematic texts, such as instructions for weapon-making or texts promoting hateful ideologies.

The debate on the ethics of LLMs is complex as it ultimately concerns our personal and collective values. OpenAI has faced criticism for its ethical controls, with some claiming that ChatGPT is "woke" – in other words, biased towards left-wing ideas and against right-wing ones. These accusations echo those leveled against social media platforms such as Twitter and Facebook, even though numerous studies have debunked such bias.

Alongside ethical debates, another technological challenge looms: the ever-increasing size of large language models. For a long time, the dominant belief was that the size of the language model directly correlated with its efficiency and potential for emergent behavior. It was under this premise that GPT-4 was constructed, assuming that a larger model would yield more effective results. GPT-4 has indeed proven to be more potent than its predecessor, GPT-3. It exhibits superior capabilities in areas such as reasoning and conciseness, demonstrating the validity of the hypothesis that larger models can deliver more powerful performance.

Yet, in the open-source community, an alternative approach has been gaining momentum. The successful training of smaller models, delivering performances comparable to GPT-3, has sparked a growing interest in a philosophy championing efficiency - the idea of achieving more with less. Smaller models are easier and cheaper to train. Some can even be trained on a personal computer, opening up a multitude of applications for developers.

However, large LLMs suffer from a problem of overfitting; it's hard to know if they have simply memorized the text corpus or if they've developed an adequate general representation.

As more and more online texts are generated by LLMs, language standardization becomes a concern. LLMs, after all, do not have the capacity to reason. They are merely machines predicting the next word based on the preceding sequence. This is a crucial difference from human intelligence, which doesn't need to know every word in a dictionary to reason in a language.

Ultimately, LLMs may be seen as a sort of language database and an interface for querying human textual thought. They are not truly intelligent, but they are capable of mimicking intelligence by leveraging vast volumes of text.

However, this doesn't mean that LLMs are without value. On the contrary, they can be extremely useful for tasks like drafting summaries, translation, generating standardized texts, and even generating code from natural language. The most promising aspect of LLMs could well be their integration with other tools, transforming traditional window-, form-, and button-based interfaces into natural language-based conversational interfaces.

The AI war has indeed begun, and it brings with it a host of ethical and technological challenges. Advances in LLMs are inevitable, but it's critical that these advances are framed by rigorous ethical debates and a deep understanding of the technological implications. Ultimately, the goal should be to develop tools that enhance people's lives, while respecting their values and minimizing potential risks.

Striking this delicate balance is the challenge facing AI researchers in the era of language models. And while there's still a long way to go, the excitement of what the future might bring is undeniable.