Latest Robots

Monday, 3 October 2022

[New post] Lauren Fonteyn at LEL seminar

Site logo image manling posted: " The 2022-23 edition of the LEL seminar begins tomorrow (4th October) with a talk by Lauren Fonteyn (Leiden University). The talk will be at 4pm in Simon 1.34. The abstract and the title are below. Featured photo is from Lauren's website Mac" Manchet

Lauren Fonteyn at LEL seminar

manling

Oct 3

The 2022-23 edition of the LEL seminar begins tomorrow (4th October) with a talk by Lauren Fonteyn (Leiden University). The talk will be at 4pm in Simon 1.34. The abstract and the title are below.

Featured photo is from Lauren's website

MacBERTh & GysBERT:  using machine learning to automate  grammatical and semantic data annotation in historical corpora 

Lauren Fonteyn 

Leiden University 

In this talk, I will demonstrate how contextualized embeddings – which are a type of compressed token-based semantic vectors – can be used as annotation and research tools. More specifically, I will focus on the use of the Bidirectional Encoder Representations from Transformers model, also known as 'BERT' (Devlin et al. 2019). 

Originally, BERT was set up for Present-day English, having been pre-trained on 3.2 billion words of Present-day English Wikipedia and Google books data. Yet, researchers who interpret and analyse historical textual material are well aware that the interpretation of textual/linguistic material from the past should not be approached from a present-day point-of-view. Hence, NLP models pre-trained on present-day language data are less than ideal candidates for the job. For the case study presented in this paper, we use two variants of BERT called MacBERTh (Manjavacas and Fonteyn, 2021, 2022), which has been pre-trained on approximately 3.9B (tokenized) words of historical English (time span: 1450-1950), and GysBERT, which has been pre-trained on 7.1B (tokenized) words of historical Dutch (time span: 1500-1950). 

These models will be put into action in two different but thematically related case studies on individual-level language variation. The first case study, which focusses on variation and change in the use of English ing-forms by Early Modern English individuals, demonstrates how the models can be used to automate grammatical annotation. The second case study demonstrates how contextualized embeddings can be integrated into lexical diversity measures to allow us to not only consider the 'vocabulary richness' but also the 'semantic richness' of texts produced by different authors. 

Like

Unsubscribe to no longer receive posts from Manchet.
Change your email settings at manage subscriptions.

Trouble clicking? Copy and paste this URL into your browser:
https://manling.wordpress.com/2022/10/03/lauren-fonteyn-at-lel-seminar/

Powered by WordPress.com
Download on the App Store Get it on Google Play
at October 03, 2022
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home
Subscribe to: Post Comments (Atom)

[New post] Giants

...

  • [New post] Slotxo Internet based Picture slot machine game Fun-based activities That will Play
    Piddle Pops posted: "Discover these days more information on Slot machines performance Gear Financial businesses as well as...
  • [New post] Giants
    ...
  • [New post] Plutonium contamination in Ohio, USA
    Chri...
  • https://paxorex.blogspot.com
  • https://acciyo.blogspot.com
  • https://sunbrew.blogspot.com
  • https://readingvox.blogspot.com
  • https://neextdraft.blogspot.com
  • https://udimy.blogspot.com
  • https://arcieve.blogspot.com
  • https://diabetesmail.blogspot.com
  • https://quiltingmail.blogspot.com
  • https://downloadallyouwanttutorials.blogspot.com
  • https://increasingmarketingsystem.blogspot.com
  • https://skysportingnewsnationspinquirer.blogspot.com
  • https://politicnewsbusterinsiderpostreview.blogspot.com
  • https://javascripttrendlist.blogspot.com
  • https://teraqiitatail.blogspot.com
  • https://bigpalacenews.blogspot.com
  • https://executivetowernews.blogspot.com
  • https://magnificentplannews.blogspot.com
  • https://businessinboard.blogspot.com
  • https://patriotsscience.blogspot.com
  • https://allinonequantumleap.blogspot.com
  • https://foodandrecipefusion.blogspot.com
  • https://newsletterforeveryone.blogspot.com
  • https://snacksrobinhood.blogspot.com
  • https://dailynewslettersph.blogspot.com
  • https://rankedrama.blogspot.com
  • https://oschinanet.blogspot.com
  • https://nourich.blogspot.com
  • https://phnewsnet.blogspot.com
  • https://structuresusingc.blogspot.com
  • https://foodubers.blogspot.com
  • https://genuinequality.blogspot.com
  • https://techdigitalmedia.blogspot.com
  • https://entertainmenhubtbiz.blogspot.com
  • https://sportsbookwire.blogspot.com
  • https://societycast.blogspot.com
  • https://lifestylesportsreturn.blogspot.com
  • https://natureimpactfactor.blogspot.com
  • https://artnetworth.blogspot.com
  • https://entrepreneurexamples.blogspot.com
  • https://cryptomarketbase.blogspot.com
  • https://btsbiot.blogspot.com
  • https://sexybinikis.blogspot.com
  • https://foreignexchangecurrency.blogspot.com
  • https://classifiedexample.blogspot.com
  • https://bookboons.blogspot.com
  • https://writingdate.blogspot.com
  • https://wamios.blogspot.com
  • https://justmightdiy.blogspot.com
  • https://playfreeonlinegamesmore.blogspot.com
  • https://healthlinefitnessfirst.blogspot.com
  • https://snaptikvideodownloader.blogspot.com
  • https://pokemonunitepc.blogspot.com
  • https://neverthelesskdrama.blogspot.com
  • https://coolantioniq.blogspot.com
  • https://hackerploit.blogspot.com
  • https://ballbreakdown.blogspot.com
  • https://flixsterio.blogspot.com
  • https://fortnitebattleroyaletrack.blogspot.com
  • https://manilaplus.blogspot.com
  • https://davaoplus.blogspot.com
  • https://tutorialsfiles.blogspot.com
  • https://mondaymorningcookingclub.blogspot.com
  • https://gymnearmee.blogspot.com
  • https://windows26.blogspot.com
  • https://millionaireinvest.blogspot.com
  • https://latestkhmernews.blogspot.com
  • https://latestisraelnews.blogspot.com
  • https://latestaustralianews.blogspot.com
  • https://latestirannews.blogspot.com
  • https://latestjapannews.blogspot.com
  • https://latestsaudinews.blogspot.com
  • https://latestfreecourse.blogspot.com
  • https://ikeafurnitureaccessories.blogspot.com
  • https://makeupandbeautyproduct.blogspot.com
  • https://latestpets.blogspot.com
  • https://topecommerceniches.blogspot.com
  • https://latesttexasnews.blogspot.com
  • https://latestufcgame.blogspot.com
  • https://tipweightlossfast.blogspot.com
  • https://latestcancercure.blogspot.com
  • https://philsys.blogspot.com
  • https://phoramensoba.blogspot.com
  • https://latestcupcakes.blogspot.com
  • https://latestgivex.blogspot.com
  • https://latestlottoresult.blogspot.com
  • https://downloadarchived.blogspot.com
  • https://doesports.blogspot.com

Search This Blog

  • Home

About Me

latest robot
View my complete profile

Report Abuse

Blog Archive

  • October 2023 (1228)
  • September 2023 (1871)
  • August 2023 (1663)
  • July 2023 (1819)
  • June 2023 (1774)
  • May 2023 (1651)
  • April 2023 (1598)
  • March 2023 (1753)
  • February 2023 (1419)
  • January 2023 (1661)
  • December 2022 (1507)
  • November 2022 (1620)
  • October 2022 (1463)
  • September 2022 (1332)
  • August 2022 (1370)
  • July 2022 (1493)
  • June 2022 (1331)
  • May 2022 (1450)
  • April 2022 (1438)
  • March 2022 (1366)
  • February 2022 (958)
  • January 2022 (994)
  • December 2021 (1759)
  • November 2021 (3125)
  • October 2021 (3244)
  • September 2021 (3138)
  • August 2021 (3240)
  • July 2021 (1142)
Powered by Blogger.