Doğruluk Problemi için Veri Kümesi Hazırlanması

Karabayır, Arif Kürşat; Tek, Ozan Onur; Çınar, Özgür Fırat; Tekir, Selma

Please use this identifier to cite or link to this item: https://hdl.handle.net/11147/7061

Title:	Doğruluk Problemi için Veri Kümesi Hazırlanması
Other Titles:	Construction of a Dataset for the Veracity Problem
Authors:	Karabayır, Arif Kürşat Tek, Ozan Onur Çınar, Özgür Fırat Tekir, Selma
Keywords:	Dataset Veracity problem WEB
Publisher:	CEUR Workshop Proceedings
Source:	Karabayır, A. K., Tek, O. O., Çınar, Ö. F. and Tekir, S. (2018, September 10-12). Doğruluk problemi için veri kümesi hazırlanması. In A. Tarhan and Murat E. (Eds.), Paper presented at 12th Turkish National Software Engineering Symposium, UYMS 2018; Istanbul; Turkey.
Abstract:	Internet has become one of the most important information sources. With the advent of Internet, the ease of access and sharing of information have caused the emergence of conflicting information. The increase in conflicting information makes it a challenge to find the truth out of it. This problem is named as the veracity problem. The algorithms that were developed in response to this problem accept structured data as in¬ put. Thus, to be able to use these algorithms on Internet, there is a need to transform the unstructured data on the Internet into a structured form. This need is hard to fulfill in a domain-independent and automatic way considering the variety on Internet. In this work; structured data preparation to test the effectiveness of the truth-finder algorithms is experienced. The process of transforming the unstructured data on the Internet into a structured form is described in steps to contribute its generalization in a domain-independent way. As a result of this process, a new quotes data set is constructed and a truth-finder algorithm is tested on this dataset by giving some comments on it. İnternet günümüzde en önemli bilgi kaynaklarından biri haline gelmiştir.İnternet ile birlikte, bilgiye ulaşımın ve paylaşın kolaylaşması, çelişkili bilgilerin açığa çıkmasına sebep olmuştur. Çelişkili bilgilerin artmasıyla, bunlar arasında doğru olanı bulmak da her geçen gün zorlaşmaktadır. Bu sorun literatürde doğruluk (veracity) problemi olarak tanımlanmıştır. Bu alanda geliştirilen algoritmalar girdi olarak yapısal veriyi kabul etmektedir. Bu algoritmaların internet üzerinde kullanılabilmesi için internetteki yapısal olmayan verinin yapısal forma dönüştürülmesi gerekmektedir. İnternet'teki verinin çeşitliliği düşünüldüğünde bu işin konudan bağımsız, otomatik olarak gerçekleştirilmesi zordur. Bu çalışmada doğruluk problemi üzerine geliştirilen algoritmaların sınanabilmesi için internetteki yapısal olmayan verilerin yapısal bir veri kümesine dönüştürülmesinde gerekli aşamalar belirlenip otomatize edilmesine katkı sağlanmıştır. Bu aşamalar kullanılarak örnek bir özdeyiş veri kümesi oluşturulmuş ve belirlenen bir doğruluk sınama algoritması bu veri kümesinde uygulanarak elde edilen sonuçclar yorumlanmıştır.
Description:	12th Turkish National Software Engineering Symposium, UYMS 2018; Istanbul; Turkey; 10 September 2018 through 12 September 2018
URI:	http://ceur-ws.org/Vol-2201/UYMS_2018_paper_84.pdf http://hdl.handle.net/11147/7061
ISSN:	1613-0073
Appears in Collections:	Computer Engineering / Bilgisayar Mühendisliği Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Files in This Item:

File	Description	Size	Format
7061.pdf	Conference Paper	368.74 kB	Adobe PDF	View/Open

Show full item record

CORE Recommender

Page view(s)

408

checked on Apr 21, 2025

Download(s)

104

checked on Apr 21, 2025

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM