Enriching Contextual Word Embeddings With Character Information

Polatbilek, Ozan

Please use this identifier to cite or link to this item: https://hdl.handle.net/11147/10984

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Tekir, Selma	-
dc.contributor.author	Polatbilek, Ozan	-
dc.date.accessioned	2021-07-04T09:36:05Z	-
dc.date.available	2021-07-04T09:36:05Z	-
dc.date.issued	2020-07	-
dc.identifier.citation	Polatbilek, O. (2020). Enriching contextual word embeddings with character information. Unpublished master's thesis, İzmir Institute of Technology, İzmir, Turkey	en_US
dc.identifier.uri	https://hdl.handle.net/11147/10984	-
dc.description	Thesis (Master)--Izmir Institute of Technology, Computer Engineering, Izmir, 2020	en_US
dc.description	Includes bibliographical references (leaves: 58-64)	en_US
dc.description	Text in English; Abstract: Turkish and English	en_US
dc.description.abstract	Natural Language Processing has become more and more popular with the recent advances in Artificial Intelligence. Fundamental improvements have been introduced in word representations to store semantic and/or syntactic features. With the recently published language model BERT, contextual word vectors could be generated. This model do not process character level information. In morphologically rich languages such as Turkish, this model's perception of syntax could be improved. In this thesis, a new model, called BERT-ELMo, which is a combination of BERT and ELMo, is proposed to enrich BERT with character level information. This model combines character level processing part of ELMo and contextual word representation part of the BERT model. To show the effectiveness of the proposed model, both quantitative (question answering) and qualitative (word analogy, word contextualization, morphological meaning, out of vocabulary word capturing) analyses are performed and it is compared with BERT on Turkish language. Thanks to character level addition, proposed model is able get trained in any language without any pre-analysis.To the best of our knowledge, this is the first study which uses morphological analysis to train the BERT model in Turkish, and the first model to integrate a character level module to BERT.	en_US
dc.description.abstract	Doğal dil işleme, günümüzdeki yapay zeka gelişmelerinin de yardımıyla popülerlik kazanmıştır. Bu popülerlik sayesinde farklı alt alanlarda yüksek başarımlı çalışmalar yayınlanmaktadır. Bu alt alanlardan birisi de doğal dil işlemenin temel problemlerinden birini ele alan kelimelerin vektörel gösterimi alanıdır. Kelimelerin sözdizimi ve anlamsal bilgilerini ihtiva etmesi amacıyla vektörel gösterim kullanılmaktadır. Yakın zamanda yayınlanan BERT modeli ile kelime vektörleri bağlamsal olarak yüksek bir başarım ile gösterilebilmektedir. Öte yandan bu model sözdizimi kurallarını gözlemleyecek karakter seviyesinde bir yapı içermediğinden, Türkçe gibi morfolojik olarak zengin dillerde istenen sonuçları veremeyebilmektedir. Bu çalışmada, BERT modelinin karakter seviyesinde de bilgi işleyebilmesini sağlamak amacıyla, BERT-ELMo modeli önerilmiştir. Bu yeni önerilen modelde, ELMo modelinin karakter seviyesinde işlem yapabilen modülü ile BERT modelinin bağlamsal kelime vektörü üreten modülü birleştirilmiştir. BERT-ELMo modeli hem nicel and nitel analizler ile incelenmiş ve BERT modeli ile Türkçe dili üzerinden karşılaştırılmıştır. Karakter seviyesindeki ekleme sebebiyle, önerilen model herhangi bir dilde herhangi bir ön çalışma yapılmaksızın çalıştırılabilir. Bilinen kadarıyla, bu çalışma Türkçe morfolojik analizi ile BERT modelini eğiten ve BERT modeline karakter seviyesinde bir modül eklemeyi deneyen ilk çalışmadır.	en_US
dc.description.sponsorship	TUBITAK (117E747)	en_US
dc.format.extent	x, 79 leaves	-
dc.language.iso	en	en_US
dc.publisher	Izmir Institute of Technology	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Artificial intelligence methods	en_US
dc.subject	Natural language processing	en_US
dc.subject	BERT model	en_US
dc.title	Enriching Contextual Word Embeddings With Character Information	en_US
dc.title.alternative	Bağlamsal Kelime Gömmelerinin Karakter Bilgisi ile Zenginleştirilmesi	en_US
dc.type	Master Thesis	en_US
dc.authorid	0000-0001-5556-7360	-
dc.department	Thesis (Master)--İzmir Institute of Technology, Computer Engineering	en_US
dc.relation.tubitak	info:eu-repo/grantAgreement/TUBITAK/EEEAG/117E747	-
dc.relation.publicationcategory	Tez	en_US
dc.identifier.wosquality	N/A	-
dc.identifier.scopusquality	N/A	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.languageiso639-1	en	-
item.openairetype	Master Thesis	-
item.grantfulltext	open	-
item.fulltext	With Fulltext	-
item.cerifentitytype	Publications	-
Appears in Collections:	Master Degree / Yüksek Lisans Tezleri

Files in This Item:

File	Size	Format
10349752.pdf	1.7 MB	Adobe PDF	View/Open

Show simple item record

CORE Recommender

Page view(s)

402

checked on Mar 31, 2025

Download(s)

350

checked on Mar 31, 2025

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM