The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard

Kabakuş, Abdullah Talha; Dogru, İbrahim

The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard

dc.contributor.author	Kabakuş, Abdullah Talha
dc.contributor.author	Dogru, İbrahim
dc.date.accessioned	2025-03-24T19:47:28Z
dc.date.available	2025-03-24T19:47:28Z
dc.date.issued	2024
dc.department	Düzce Üniversitesi
dc.description.abstract	Nowadays, it is hard to find a part of human life that Artificial Intelligence (AI) has not been involved in. With the recent advances in AI, the change for chatbots has been an ‘evolution’ instead of a ‘revolution’. AI-powered chatbots have become an integral part of customer services as they are as functional as humans (if not more), and they can provide 24/7 service (unlike humans). There are several publicly available, widely used AI-powered chatbots. So, “Which one is better?” is a question that instinctively comes to mind and needs to be shed light on. Motivated by the question, an experimental comparison of two widely used AI-powered chatbots, namely ChatGPT and Bard, was proposed in this study. For a quantitative comparison, (i) a gold standard QA dataset, which comprised 2.390 questions from 109 topics, was used, and (ii) a novel answer-scoring algorithm was proposed. The covered chatbots were evaluated using the proposed algorithm on the dataset to reveal their (i) generated answer length, and (ii) generated answer accuracy. According to the experimental results, (i) Bard generated lengthy answers compared to ChatGPT, and (ii) Bard provided answers more similar to the ground truth compared to ChatGPT.
dc.description.abstract	Nowadays, it is hard to find a part of human life that Artificial Intelligence (AI) has not been involved in. With the recent advances in AI, the change for chatbots has been an ‘evolution’ instead of a ‘revolution’. AI-powered chatbots have become an integral part of customer services as they are as functional as humans (if not more), and they can provide 24/7 service (unlike humans). There are several publicly available, widely used AI-powered chatbots. So, “Which one is better?” is a question that instinctively comes to mind and needs to shed light on. Motivated by the question, an experimental comparison of two widely used AI-powered chatbots, namely ChatGPT and Bard, was proposed in this study. For a quantitative comparison, (i) a gold standard QA dataset, which comprised 2,390 questions from 109 topics, was used and (ii) a novel answer-scoring algorithm based on cosine similarity was proposed. The covered chatbots were evaluated using the proposed algorithm on the dataset to reveal their (i) generated answer length and (ii) generated answer accuracy. According to the experimental results, (i) Bard generated lengthy answers compared to ChatGPT and (ii) Bard provided answers more similar to the ground truth compared to ChatGPT.
dc.identifier.doi	10.29137/umagd.1390083
dc.identifier.endpage	691
dc.identifier.issn	1308-5514
dc.identifier.issue	2
dc.identifier.startpage	679
dc.identifier.uri	https://doi.org/10.29137/umagd.1390083
dc.identifier.uri	https://hdl.handle.net/20.500.12684/18756
dc.identifier.volume	16
dc.language.iso	en
dc.publisher	Kirikkale University
dc.relation.ispartof	International Journal of Engineering Research and Development
dc.relation.publicationcategory	Makale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_DergiPark_20250324
dc.subject	chatbot\|question answering\|artificial intelligence\|ChatGPT\|Bard\|geniş dil modeli\|chatbot\|question answering\|artificial intelligence\|ChatGPT\|Bard\|large language model
dc.title	The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard
dc.title.alternative	The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard
dc.type	Article

Koleksiyon

Öksüz Yayınlar Koleksiyonu

The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard

Dosyalar

Koleksiyon