• Home
  • About
  • Policies
  • Contact
    • Türkçe
    • English
  • English 
    • Türkçe
    • English
  • Login
Advanced Search
View Item 
  •   Home
  • Mühendislik Fakültesi / Faculty of Engineering
  • Bilgisayar Mühendisliği / Computer Engineering
  • Bildiriler, Kongreler ve Sempozyumlar / Declarations, Congresses and Symposiums
  • View Item
  •   Home
  • Mühendislik Fakültesi / Faculty of Engineering
  • Bilgisayar Mühendisliği / Computer Engineering
  • Bildiriler, Kongreler ve Sempozyumlar / Declarations, Congresses and Symposiums
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A Comparative Study to Determine the Effective Window Size of Turkish Word Sense Disambiguation Systems

Thumbnail
Author
İlgen, Bahar
Adalı, Eşref
Tantuğ, Ahmet Cüneyd
Type
conferenceObject
Date
2013
Language
en_US
Metadata
Show full item record
Abstract
In this paper, the effect of different windowing schemes on word sense disambiguation accuracy is presented. Turkish Lexical SampleDataset has been used in the experiments. We took the samples of ambiguous verbs and nouns of the dataset and used bag-of-word properties as context information. The experi-ments have been repeated for different window sizes based on several machine learning algorithms. We follow 2/3 splitting strategy (2/3 for training, 1/3 for test-ing) and determine the most frequently used words in the training part. After re-moving stop words, we repeated the experiments by using most frequent 100, 75, 50 and 25 content words of the training data. Our findings show that the usage of most frequent 75 words as features improves the accuracy in results for Turkish verbs. Similar results have been obtained for Turkish nouns when we use the most frequent 100 words of the training set. Considering this information, selected al-gorithms have been tested on varying window sizes {30, 15, 10 and 5}. Our find-ings show that Naive Bayes and Functional Tree methods yielded better accuracy results. And the window size +/-5 gives the best average results both for noun and the verb groups. It is observed that the best results of the two groups are 65.8 and 56% points above the most frequent sense baseline of the verb and noun groups respectively.
Subject
Computer Science
Information Systems
Computer Science, Theory & Methods
Engineering, Electrical & Electronic
URI
https://doi.org/10.1007/978-3-319-01604-7_17
https://hdl.handle.net/11413/2032
Collections
  • Bildiriler, Kongreler ve Sempozyumlar / Declarations, Congresses and Symposiums [45]
  • Scopus Publications [724]
  • WoS Publications [1016]

İstanbul Kültür University

Hakkında |Politika | Kütüphane | İletişim | Send Feedback | Admin

Istanbul Kültür University, Ataköy Campus E5 Karayolu Üzeri Bakırköy 34158, İstanbul / TURKEY
Copyright © İstanbul Kültür University

Creative Commons Lisansı
IKU Institutional Repository, Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.

Designed by  UNIREPOS

İKU Kütüphane


Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeLanguageBy PublisherRightsPubmedScopusWoSThis CollectionBy Issue DateAuthorsTitlesSubjectsTypeLanguageBy PublisherRightsPubmedScopusWoS

My Account

Login

İstanbul Kültür University

Hakkında |Politika | Kütüphane | İletişim | Send Feedback | Admin

Istanbul Kültür University, Ataköy Campus E5 Karayolu Üzeri Bakırköy 34158, İstanbul / TURKEY
Copyright © İstanbul Kültür University

Creative Commons Lisansı
IKU Institutional Repository, Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.

Designed by  UNIREPOS