Measuring Historical Usage of Words Over Time -
Natural Languages contain a wealth of mathematical structures and it is a representation of a complex cultural system. Language can evolve due to multiple human-related factors such as migration, technological advances, and socio-political changes. Like a biological system, language is an evolving system with words as its measurable units. The frequencies of these words can change based on its functions within a written body of text. Using the Google n-gram English dataset of the Google Books Corpus, this presentation is an exploratory data analysis and statistical inference application of 500 years’ worth of written text. While introducing the basic statistical method of language modeling, this talk also includes applying language modeling to explore how words change over time.
Thursday, February 25, 2021 at 4:45pmVirtual Event
Reed Community Members
If you are a member of the Reed community, you MUST LOG IN to see events that are open ONLY to the Reed community. Log in with your Reed ID (your Kerberos account information). If you don’t remember your account username or password, go to reed.edu/cis/help/kerberos.html.Log in with Reed ID