Open Access. Powered by Scholars. Published by Universities.®

Linguistics Commons

Open Access. Powered by Scholars. Published by Universities.®

6,222 Full-Text Articles 5,647 Authors 2,999,454 Downloads 212 Institutions

All Articles in Linguistics

Faceted Search

6,222 full-text articles. Page 5 of 180.

Improving Low-Resource Morphological Learning With Intermediate Forms From Finite State Transducers, Sarah Moeller, Ghazaleh Kazeminejad, Andrew Cowell, Mans Hulden 2019 University of Colorado

Improving Low-Resource Morphological Learning With Intermediate Forms From Finite State Transducers, Sarah Moeller, Ghazaleh Kazeminejad, Andrew Cowell, Mans Hulden

Proceedings of the Workshop on Computational Methods for Endangered Languages

Neural encoder-decoder models are usually applied to morphology learning as an end-to-end process without considering the underlying phonological representations that linguists posit as abstract forms before morphophonological rules are applied. Finite State Transducers for morphology, on the other hand, are developed to contain these underlying forms as an intermediate representation. This paper shows that training a bidirectional two-step encoder-decoder model of Arapaho verbs to learn two separate mappings between tags and abstract morphemes and morphemes and surface allomorphs improves results when training data is limited to 10,000 to 30,000 examples of inflected word forms.


Using Computational Approaches To Integrate Endangered Language Legacy Data Into Documentation Corpora: Past Experiences And Challenges Ahead, Rogier Blokland, Niko Partanen, Michael Rießler, Joshua Wilbur 2019 Uppsala University

Using Computational Approaches To Integrate Endangered Language Legacy Data Into Documentation Corpora: Past Experiences And Challenges Ahead, Rogier Blokland, Niko Partanen, Michael Rießler, Joshua Wilbur

Proceedings of the Workshop on Computational Methods for Endangered Languages

No abstract provided.


A Preliminary Plains Cree Speech Synthesizer, Atticus Harrigan, Timothy Mills, Antti Arppe 2019 University of Alberta

A Preliminary Plains Cree Speech Synthesizer, Atticus Harrigan, Timothy Mills, Antti Arppe

Proceedings of the Workshop on Computational Methods for Endangered Languages

This paper discusses the development and evaluation of a Speech Synthesizer for Plains Cree, an Algonquian language of North America. Synthesis is achieved using Simple4All and evaluation was performed using a modified Cluster Identification, Semantically Unpredictable Sentence, and a basic dichotomized judgment task. Resulting synthesis was not well received; however, observations regarding the process of speech synthesis evaluation in North American indigenous communities were made: chiefly, that tolerance for variation is often much lower in these communities than for majority languages. The evaluator did not recognize grammatically consistent but semantically nonsense strings as licit language. As a result, monosyllabic clusters ...


A Biscriptual Morphological Transducer For Crimean Tatar, Francis M. Tyers, Jonathan N. Washington, Darya Kavitskaya, Memduh Gökırmak, Nick Howell, Remziye Berberova 2019 Indiana University

A Biscriptual Morphological Transducer For Crimean Tatar, Francis M. Tyers, Jonathan N. Washington, Darya Kavitskaya, Memduh GöKırmak, Nick Howell, Remziye Berberova

Proceedings of the Workshop on Computational Methods for Endangered Languages

This paper describes a weighted finite-state morphological transducer for Crimean Tatar able to analyse and generate in both Latin and Cyrillic orthographies. This transducer was developed by a team including a community member and language expert, a field linguist who works with the community, a Turkologist with computational linguistics expertise, and an experienced computational linguist with Turkic expertise.

Dealing with two orthographic systems in the same transducer is challenging as they employ different strategies to deal with the spelling of loan words and encode the full range of the language's phonemes and their interaction. We develop the core transducer ...


An Online Platform For Community-Based Language Description And Documentation, Rebecca Everson, Wolf Honoré, Scott Grimm 2019 Independent

An Online Platform For Community-Based Language Description And Documentation, Rebecca Everson, Wolf Honoré, Scott Grimm

Proceedings of the Workshop on Computational Methods for Endangered Languages

We present two pieces of interlocking technology in development to facilitate community-based, collaborative language description and documentation: (i) a mobile app where speakers submit text, voice recordings and/or videos, and (ii) a community language portal that organizes submitted data and provides question/answer boards whereby community members can evaluate/supplement submissions.


A Software-Driven Workflow For The Reuse Of Language Documentation Data In Typological Studies, Stephan Druskat, Kilu von Prince 2019 Humboldt-Universität zu Berlin

A Software-Driven Workflow For The Reuse Of Language Documentation Data In Typological Studies, Stephan Druskat, Kilu Von Prince

Proceedings of the Workshop on Computational Methods for Endangered Languages

No abstract provided.


Digital Dictionary Development For Torwali, A Less-Studied Language: Process And Challenges, Inam Ullah 2019 Torwali Research Forum

Digital Dictionary Development For Torwali, A Less-Studied Language: Process And Challenges, Inam Ullah

Proceedings of the Workshop on Computational Methods for Endangered Languages

No abstract provided.


La Syntaxe Et L'Interprétation Des Questions Coordonnées, Mikalai Kliashchuk 2019 The University of Western Ontario

La Syntaxe Et L'Interprétation Des Questions Coordonnées, Mikalai Kliashchuk

Electronic Thesis and Dissertation Repository

Cette thèse traite des questions coordonnées dans les langues slaves et en roumain. Nous considérons trois approches proposées dans la littérature (approche monopropositionnelle, approche à écluse, approche mixte à structures multidominantes) et concluons que l’analyse en termes des structures multidominantes de Citko et Gračanin-Yuksek (2013) est la meilleure à rendre compte des interrogatives coordonnées à travers les langues. Nous montrons que la seule structure nécessaire pour dériver les interrogatives coordonnées dans toutes les langues slaves et en roumain est la structure multidominante de partage en gros. Cette structure est disponible uniquement dans les langues au mouvement-QU multiple. Elle implique ...


Project On Designing Activities For Teaching Grammar, Sunny Hyon 2019 California State University, San Bernardino

Project On Designing Activities For Teaching Grammar, Sunny Hyon

Q2S Enhancing Pedagogy

This document outlines a project for a course in second language acquisition taken by undergraduates who intend to be high school or university teachers/professors. The project asks the undergraduates to design a sequence of activities for teaching English language learners (ELLs) a particular English grammatical construction, and to connect those activities to reading or writing assignments in their future classrooms. The students also must explain how their activities will facilitate ELLs' grammatical learning, drawing on research on language acquisition presented in the course.


Techniques And Methods Change, Methodology Remains The Same: Web Technology Use As Cosmetic Change In Cfl Classrooms, Justin Olmanson, Xianquan Liu, Nannan Wang, Christopher Heselton 2019 University of Nebraska-Lincoln

Techniques And Methods Change, Methodology Remains The Same: Web Technology Use As Cosmetic Change In Cfl Classrooms, Justin Olmanson, Xianquan Liu, Nannan Wang, Christopher Heselton

Chinese Language Teaching Methodology and Technology

Technology has been a staple in the language classroom for more than fifty years. From audio cassettes, to video tapes, to multimedia CD-ROMs, to static and interactive web technologies, language teachers have taken the time to learn these tools and integrate them into the classroom experience. Each new technology, each new app, creates an opportunity to alter the experience of teaching and learning—often supporting increased authentic interactions with the Chinese language. This potential however can go unrealized when the ways the technologies are used align with more traditional grammar and vocabulary-focused teaching methodologies and promote efficiency over communication. In ...


Mandarin Assessment In Chinese-English Bilingual Preschoolers, Jennifer A. Chard 2019 The Graduate Center, City University of New York

Mandarin Assessment In Chinese-English Bilingual Preschoolers, Jennifer A. Chard

All Dissertations, Theses, and Capstone Projects

Immigrant children who grow up in linguistically and culturally diverse households are at risk for misdiagnosis for language impairment and inappropriate placement in or exclusion from special education classes. Research shows that native language testing is essential in determining eligibility for disability services, as reflected both in federal law (Individuals with Disabilities Education Improvement Act of 2004). However, despite growing agreement that native language assessment is a critical component to understanding the abilities and challenges bilingual students face, the standard assessments currently used are largely administered in Standard English and normed on monolingual English speakers. Few options are available to ...


Generative Adversarial Networks And Word Embeddings For Natural Language Generation, Robert D. Schultz Jr 2019 The Graduate Center, City University of New York

Generative Adversarial Networks And Word Embeddings For Natural Language Generation, Robert D. Schultz Jr

All Dissertations, Theses, and Capstone Projects

We explore using image generation techniques to generate natural language. Generative Adversarial Networks (GANs), normally used for image generation, were used for this task. To avoid using discrete data such as one-hot encoded vectors, with dimensions corresponding to vocabulary size, we instead use word embeddings as training data. The main motivation for this is the fact that a sentence translated into a sequence of word embeddings (a “word matrix”) is an analogue to a matrix of pixel values in an image. These word matrices can then be used to train a generative adversarial model. The output of the model’s ...


Obfuscating Authorship: Results Of A User Study On Nondescript, A Digital Privacy Tool, Robin Camille Davis 2019 CUNY John Jay College

Obfuscating Authorship: Results Of A User Study On Nondescript, A Digital Privacy Tool, Robin Camille Davis

Publications and Research

For those who write anonymously, particularly for safety reasons, authorship attribution poses a threat. Nondescript, my web app, guides writers in achieving stylometric obfuscation in order to preserve anonymity. The app runs simulations of authorship attribution scenarios by analyzing the user’s linguistic features. In this paper, I will describe the conception of the Nondescript app; discuss related work; and present the results of a user study. Most users in the study were able to anonymize their writing in at least 5 out of 10 authorship attribution scenarios. Users rated the anonymization process an average of 3.6 out of ...


Hrothgar And Wealhtheow: An Onomastic Approach To A Story Of Good Governance, D. Marie Nelson 2019 University of Florida

Hrothgar And Wealhtheow: An Onomastic Approach To A Story Of Good Governance, D. Marie Nelson

Journal of Literary Onomastics

No abstract.


Place-Names And Politics In The Awntyrs Off Arthure, Andrew Breeze 2019 Universidad de Navarra, Pamplona

Place-Names And Politics In The Awntyrs Off Arthure, Andrew Breeze

Journal of Literary Onomastics

No abstract.


Multilingual Moomins: Examining The Translation Of Tove Jansson’S Nonsense Character Names From Swedish To English And Finnish, Janine A E Strandberg 2019 University of Groningen

Multilingual Moomins: Examining The Translation Of Tove Jansson’S Nonsense Character Names From Swedish To English And Finnish, Janine A E Strandberg

Journal of Literary Onomastics

No abstract.


Permanent Functions Of Characters’ Proper Names In Harry Potter, Martyna Gibka 2019 Koszalin University of Technology

Permanent Functions Of Characters’ Proper Names In Harry Potter, Martyna Gibka

Journal of Literary Onomastics

No abstract.


Defining A Practicum In Applied Linguistics Through Duoethnography To Comply With The National Postgraduate Audit Culture In Mexico, Troy Crawford, Edgar Emmanuell Garcia-Ponce 2019 University of Guanajuato

Defining A Practicum In Applied Linguistics Through Duoethnography To Comply With The National Postgraduate Audit Culture In Mexico, Troy Crawford, Edgar Emmanuell Garcia-Ponce

The Qualitative Report

In this article, we attempt to conciliate the issues of an audit culture that surround the postgraduate programs in Mexico through the National Council of Science and Technology (CONACyT) with highly flexible definition of the course using duoethnography. The goal being to satisfy national requirements yet retain freedom in applied research in English as a foreign language teaching. This is done through an intense process of data recycling with the student participants in a semester-long analysis. This analysis also involved a continuous editing and reediting process to try and connect all the participants’ autobiographical dots of the course to arrive ...


Text-Speech Alignment: A Robin Hood Approach For Endangered Languages, Claire Bowern, Rikker Dockum, Sarah Babinski, Hunter Craft, Anelisa Fergus, Dolly Goldenberg 2019 Yale University

Text-Speech Alignment: A Robin Hood Approach For Endangered Languages, Claire Bowern, Rikker Dockum, Sarah Babinski, Hunter Craft, Anelisa Fergus, Dolly Goldenberg

Yale Day of Data

Forced alignment automatically aligns audio recordings of spoken language with transcripts at the level of individual sounds, greatly reducing the time required to prepare data for linguistic analysis. However, existing algorithms are mostly trained on a few well-documented languages. We test the performance of three algorithms against manually aligned data on data from a highly endangered language. At least some tasks, unsupervised alignment (either based on English or trained from a small corpus) is sufficiently reliable for it to be used on legacy data for low-resource languages. Descriptive phonetic work on vowel inventories and prosody can be accurately captured by ...


Structural Justice: A Critical Feminist Framework Exploring The Intersection Between Justice, Equity And Structural Reconciliation., Camille Burnett, Michael Swanberg, Ashley Hudson, Donna Schminkey 2019 University of Virginia School of Nursing

Structural Justice: A Critical Feminist Framework Exploring The Intersection Between Justice, Equity And Structural Reconciliation., Camille Burnett, Michael Swanberg, Ashley Hudson, Donna Schminkey

Journal of Health Disparities Research and Practice

Violence against women is a human rights violation (UN, 2006). It affects the health of women globally (UN, 2009) and its elimination is at the heart of many international and national goals. Intimate partner violence (IPV), one of the most common forms of gender-based violence, affects one in three women worldwide (WHO, 2013). The consequences of IPV create negative health outcomes for women that diminish their quality of life and their overall well-being. Abused women access community supports such as shelters to seek safe refuge from the abuse and restore their lives. While shelters play an extensive role in helping ...


Digital Commons powered by bepress