Bayes Centre

Understand word embeddings using tidy data principles

Modern NLP frameworks often depend on word embeddings, a way of statistically modeling language where words or phrases are mapped to vectors of real numbers. In this talk, we’ll work to understand word embeddings by investigating how we can generate them using count-based statistics and dimensionality reduction, then learn how to make use of pre-trained embeddings based on enormous datasets. Finally, we’ll explore the ethical issues involved in using word embeddings and how they can amplify systemic and historical bias. A local chapter of R-Ladies Global, R-Ladies Edinburgh exists to promote gender diversity in the R community worldwide. We are pro-actively inclusive of queer, trans, and all minority identities, with additional sensitivity to intersectional identities. Our priority is to provide a safe community space for anyone identifying as a minority gender who is interested in and/or working with R. As a founding principle, there is no cost or charge to participate in any of our R-Ladies communities around the world. R-Ladies events are open to all, however if you do not identify as a minority gender, we ask that you come along as the guest of someone who does.

Please review our code of conduct before attending the event.

Sign Up

Apr 02 2020 -

Understand word embeddings using tidy data principles

For our April meetup we're so very lucky to have Julia Silge join us! Julia is a data scientist and software engineer at RStudio where she works on open-source modeling tools. She is also the co-author of Tidy Text Mining with R .

G.03
Bayes Centre
47 Potterrow
Edinburgh
EH8 9BT