OwlOCR



Same problem domain as in action extensions, but didn't really see specific answer there. Also, I need to have a UI for my app, so removing UI (the one I provide) isn't really an option. MAU went down briefly after the crazy days in late 2020 where lots of people tried the app and for some of them it didn't stick of course.

  • < Previous
  • Next >

College of Arts and Letters Posters

OwlOCR

Title

Presenter and Co-Authors

College

College of Arts & Letters

Department

English Balloon tower defencefun in ict.

Program

Applied Linguistics

Owlocr

Publication Date

4-2021

DOI

0000-0002-6942-6689 (McCullough)

Abstract

Owl Creek

Ladino (or Judeo-Spanish) is a Diasporic Jewish language spoken by Sephardi Jews. There is little existing scholarly research on Ladino, nor does it have many language learning materials. These two factors compelled me to create the Aki Yerushalayim Corpus. The initial Aki Yerushalayim Corpus of Modern Written Ladino (currently ~7,000 words) was not created to act as a reference corpus of Modern Ladino. Rather, it was created to study the composition of Ladino prose and demonstrate the utility of this type project in the subdiscipline of language documentation. In addition, the project’s focus on cultural essays and narrative prose allow for insights into how Ladino writers construct their identities through word choice.

The research questions that directed the creation of the corpus are as follows: 1. What substrata feature most prominently in the Ladino lexicon? 2. Do the borrowed words belong to a particular semantic domain? 3. What parts of speech are most/least frequently borrowed?

To address and answer these questions, the gathered texts were first scanned with an optical character recognition software (OwlOCR) before being proofread for any mistakes. They were then tagged for Part of Speech and for language of origin and run through a concordancer (AntConc).

From this brief glimpse at Ladino prose in the late 20th and early 21st centuries, the results are as expected: the language is Spanish-based, and the most frequently-used substrata are Biblical Hebrew and Turkish, primarily used for religious and secular/cultural words, respectively. This pilot gives a snapshot into how authors represent their identity as Sephardi Jews in both cultural essays and narrative prose, and further study of the distribution of these loanwords will allow linguists to understand how language contact and a speech community’s history can provide context that illuminates which loanwords are most frequently adopted and why that may be. Alf escapes!.

Disciplines

Applied Linguistics | Jewish Studies

Files

Recommended Citation

McCullough, Rachel, 'The Aki Yerushalayim Corpus: A Study of Loanwords in Ladino' (2021). College of Arts and Letters Posters. 5.
https://digitalcommons.odu.edu/gradposters2021_artsletters/5


Super contra 7. DOWNLOADS

Since April 01, 2021

Owlcorn Hatchimal

Owocry

Included in

Owlcrate

Applied Linguistics Commons, Jewish Studies Commons

Owlcore

COinS