{"id":2776,"date":"2017-08-28T14:23:41","date_gmt":"2017-08-28T21:23:41","guid":{"rendered":"https:\/\/www.springboard.com\/?p=2776"},"modified":"2023-08-22T10:29:49","modified_gmt":"2023-08-22T17:29:49","slug":"introduction-word-embeddings","status":"publish","type":"post","link":"https:\/\/www.springboard.com\/blog\/data-science\/introduction-word-embeddings\/","title":{"rendered":"An Introduction to Word Embeddings"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Understanding Word Embeddings<\/h2>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><em><strong>Part 1: Applications<\/strong><\/em><\/h3>\n\n\n\n<p><em><strong><span class=\"s3\">Written by Aaron Geelon So<\/span><\/strong><\/em><\/p>\n\n\n\n<p><em>If you already have a solid understanding of word embeddings and are well into your <a href=\"https:\/\/www.springboard.com\/courses\/data-science-career-track\/\" target=\"_blank\" data-type=\"URL\" data-id=\"https:\/\/www.springboard.com\/courses\/data-science-career-track\/\" rel=\"noreferrer noopener\">data science career<\/a>, skip ahead to the <a href=\"http:\/\/www.learndatasci.com\/intro-to-word-embeddings-problems-theory\/\" target=\"_blank\" rel=\"noopener\">next part<\/a>!<\/em><\/p>\n\n\n\n<p><span class=\"s4\">Human language is <span class=\"s6\">unreasonably effective<\/span> at describing how we relate to the world. With a few, short words, we can convey many ideas and actions with little ambiguity. Well, <span class=\"s6\">mostly<\/span>. <\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; B<\/span>ecause we&#8217;re capable of seeing and describing so much complexity, a lot of structure is implicitly encoded into our language. It is no easy task for a computer (or a human, for that matter) to learn <i>natural language<\/i>, for it entails understanding how we humans observe the world, if not understanding how <i>to<\/i> observe the world. <\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>For the most part, computers can&#8217;t understand natural language. Our programs are still line-by-line instructions telling a computer what to do &#8212; they often miss nuance and context. How can you explain sarcasm to a machine?&nbsp;<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp;In the area of <a href=\"https:\/\/www.springboard.com\/blog\/data-science\/data-science-definition\/\" target=\"_blank\" rel=\"noreferrer noopener\">data science<\/a>, NLP has its own importance. &nbsp; &nbsp; &nbsp; &nbsp; <\/span>There&#8217;s good news though. There&#8217;s been some important breakthroughs in <i>natural language processing<\/i> (NLP), the domain where researchers try to teach computers human language.<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>Famously, in 2013 Google researchers (<em>Mikolov 2013<\/em>) found a method that enabled a computer to learn relations between words such as:<\/span><\/p>\n\n\n\n<pre class=\"wp-block-preformatted\"><span style=\"font-weight: 400;\">king<\/span><span style=\"font-weight: 400;\">-<\/span><span style=\"font-weight: 400;\">man<\/span><span style=\"font-weight: 400;\">+<\/span><span style=\"font-weight: 400;\">woman<\/span><span style=\"font-weight: 400;\">\u2248<\/span><span style=\"font-weight: 400;\">queen<\/span><span style=\"font-weight: 400;\">.<\/span><\/pre>\n\n\n\n<p><span class=\"s5\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;This method, called <i>word embeddings<\/i>, has a lot of promise; it might even be able to reveal hidden structures in the world we see. Consider one relation it <a href=\"http:\/\/byterot.blogspot.in\/2015\/06\/five-crazy-abstractions-my-deep-learning-word2doc-model-just-did-NLP-gensim.html\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">discovered<\/span><\/a>:<\/span><\/p>\n\n\n\n<pre class=\"wp-block-preformatted\"><span style=\"font-weight: 400;\">president<\/span><span style=\"font-weight: 400;\">-<\/span><span style=\"font-weight: 400;\">power<\/span><span style=\"font-weight: 400;\">\u2248<\/span><span style=\"font-weight: 400;\">prime minister<\/span><\/pre>\n\n\n\n<p><span class=\"s5\">Admittedly, this might be one of those specious relations. <\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>Joking aside, it&#8217;s worth studying word embeddings for at least two reasons. First, there are a lot of applications made possible by word embeddings. Second, we can learn from the way researchers approached the problem of deciphering natural language for machines.&nbsp;<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>In Part 1 of this article series, let\u2019s take a look at the first of these reasons.<\/span><\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong><span class=\"s5\">Uses of Word Embeddings<\/span><\/strong><\/h4>\n\n\n\n<p><span class=\"s4\">There&#8217;s no obvious<\/span><span class=\"s5\"> way to usefully compare two words unless we already know what they mean. The goal of word-embedding algorithms is, therefore, to<b> embed words with meaning based on their similarity or relationship with other words.&nbsp;<\/b><\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>In practice, words are embedded into a real vector space, which comes with notions of distance and angle. We hope that these notions extend to the embedded words in meaningful ways, quantifying relations or similarity between different words. And empirically, they actually do!<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>For example, the Google algorithm I mentioned above discovered certain nouns are singular\/plural or have gender (Mikolov 2013abc):<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1920\" height=\"981\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy.png\" alt=\"relations - Copy\" class=\"wp-image-2785\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy.png 1920w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-400x204.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-1200x613.png 1200w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-768x392.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-1536x785.png 1536w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-380x194.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-700x358.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/relations-Copy-380x194.png 420w\" sizes=\"(max-width: 1920px) 100vw, 1920px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"> They also found a country-capital relationship:<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1346\" height=\"915\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy.png\" alt=\"country - Copy\" class=\"wp-image-2784\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy.png 1346w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-400x272.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-1200x816.png 1200w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-768x522.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-380x258.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-700x476.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/country-Copy-380x258.png 420w\" sizes=\"(max-width: 1346px) 100vw, 1346px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>And as further evidence that a word&#8217;s meaning can be implied from its relationships with other words, they actually found that the learned structure for one language often correlated to that of another language, perhaps suggesting the possibility for <a href=\"https:\/\/en.wikipedia.org\/wiki\/Machine_translation\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">machine translation<\/span><\/a> through word embeddings (Mikolov 2013c):<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1375\" height=\"938\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy.png\" alt=\"mt - Copy\" class=\"wp-image-2783\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy.png 1375w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-400x273.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-1200x819.png 1200w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-768x524.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-380x259.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-700x478.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/mt-Copy-380x259.png 420w\" sizes=\"(max-width: 1375px) 100vw, 1375px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>They released their C code as the <a href=\"https:\/\/code.google.com\/p\/word2vec\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">word2vec<\/span><\/a> package, and soon after, others adapted the algorithm for more programming languages. Notably, for <a href=\"https:\/\/radimrehurek.com\/gensim\/index.html\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">gensim<\/span><\/a><\/span> <span class=\"s5\">(Python) and <a href=\"https:\/\/deeplearning4j.org\/word2vec\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">deeplearning4j<\/span><\/a> (Java).<\/span><\/p>\n\n\n\n<p><span class=\"s5\">Today, many companies and <a href=\"https:\/\/www.springboard.com\/blog\/data-science\/what-does-a-data-scientist-do\/\" target=\"_blank\" data-type=\"post\" data-id=\"24427\" rel=\"noreferrer noopener\">data scientists<\/a><\/span> have found different ways to incorporate word2vec into their businesses and research. Spotify uses it to help provide music recommendations. Stitch Fix uses it to recommend clothing. Google is thought to use word2vec in RankBrain as part of its<span class=\"s5\"> search algorithm. <\/span><\/p>\n\n\n<div class=\"bg-leaf-50 p-4 my-3\"><h4 class=\"fw-bold text-center\">Get To Know Other\tData Science Students<\/h4><div class=\"row row-cols-1 row-cols-lg-3\"><div class=\"col\"><div class=\"card success-story-card h-100 d-flex justify-content-between mb-0\"><div class=\"flex-grow-1 text-center\"><a class=\"d-inline-block rounded-circle\" href=\"\/success\/hastings-reeves\" style=\"width:125px;height:125px;overflow:hidden\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/res.cloudinary.com\/springboard-images\/image\/upload\/v1648517255\/Student%20Success\/Hastings_Reeves_3.png\" alt=\"Hastings Reeves\" style=\"object-fit:contain;max-width:170px;height:125px\" \/><\/a><p class=\"fw-bold mb-0\">Hastings Reeves<\/p><p class=\"text-muted lh-1\">Business Intelligence Analyst at Velocity Global<\/p><\/div><div class=\"w-100 d-block d-md-none mt-3\"><\/div><p class=\"mb-0 mx-auto text-center\"><a class=\"btn btn-primary mx-auto\" href=\"\/success\/hastings-reeves\">Read Story<\/a><\/p><\/div><\/div><div class=\"col d-none d-md-block\"><div class=\"card success-story-card h-100 d-flex justify-content-between mb-0\"><div class=\"flex-grow-1 text-center\"><a class=\"d-inline-block rounded-circle\" href=\"\/success\/jonas-cuadrado\" style=\"width:125px;height:125px;overflow:hidden\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/res.cloudinary.com\/springboard-images\/image\/upload\/v1629203193\/Student%20Success\/Jonas_Cuadrado_125x125.png\" alt=\"Jonas Cuadrado\" style=\"object-fit:contain;max-width:170px;height:125px\" \/><\/a><p class=\"fw-bold mb-0\">Jonas Cuadrado<\/p><p class=\"text-muted lh-1\">Senior Data Scientist at Feedzai<\/p><\/div><p class=\"mb-0 mx-auto text-center\"><a class=\"btn btn-primary mx-auto\" href=\"\/success\/jonas-cuadrado\">Read Story<\/a><\/p><\/div><\/div><div class=\"col d-none d-md-block\"><div class=\"card success-story-card h-100 d-flex justify-content-between mb-0\"><div class=\"flex-grow-1 text-center\"><a class=\"d-inline-block rounded-circle\" href=\"\/success\/ginny-zhu\" style=\"width:125px;height:125px;overflow:hidden\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/res.cloudinary.com\/springboard-images\/image\/upload\/v1629203192\/Student%20Success\/Ginny_Zhu_125x125.png\" alt=\"Ginny Zhu\" style=\"object-fit:contain;max-width:170px;height:125px\" \/><\/a><p class=\"fw-bold mb-0\">Ginny Zhu<\/p><p class=\"text-muted lh-1\">Data Science Intern at Novartis<\/p><\/div><p class=\"mb-0 mx-auto text-center\"><a class=\"btn btn-primary mx-auto\" href=\"\/success\/ginny-zhu\">Read Story<\/a><\/p><\/div><\/div><\/div><\/div>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">\u00a0\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <\/span>Other researchers are using word2vec for sentiment analysis, which attempts to identify the emotionality behind the words people use to communicate. For example, one <a href=\"https:\/\/arxiv.org\/pdf\/1606.02820.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">Stanford research group<\/span><\/a> looked at how the same words in different Reddit communities take on different connotations. Here\u2019s an example with the word <i>soft<\/i>:<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"762\" height=\"517\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy.png\" alt=\"reddit - Copy\" class=\"wp-image-2786\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy.png 762w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy-400x271.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy-380x258.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy-700x475.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-Copy-380x258.png 420w\" sizes=\"(max-width: 762px) 100vw, 762px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>As you can see, the word &#8220;soft&#8221; has a negative connotation when you&#8217;re talking about sports (you might think of the term &#8220;soft players&#8221;) while they have a positive connotation when you&#8217;re talking about cartoons.<\/p>\n\n\n\n<p><span class=\"s5\"> And here are more examples where the computer could analyze the emotional sentiment of the same words across different communities.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"969\" height=\"432\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy.png\" alt=\"reddit-spectrum - Copy\" class=\"wp-image-2787\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy.png 969w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy-400x178.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy-768x342.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy-380x169.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy-700x312.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/reddit-spectrum-Copy-380x169.png 420w\" sizes=\"(max-width: 969px) 100vw, 969px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>They can even apply the same method over time, following how the word <i>terrific<\/i>, which meant <i>horrific<\/i> for the majority of the 20th century, has come to essentially mean <i>great<\/i> today.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"976\" height=\"564\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy.png\" alt=\"terrific - Copy\" class=\"wp-image-2788\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy.png 976w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy-400x231.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy-768x444.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy-380x220.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy-700x405.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/terrific-Copy-380x220.png 420w\" sizes=\"(max-width: 976px) 100vw, 976px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>As a light-hearted example, one <span class=\"s6\">research group<\/span><\/span> <span class=\"s5\">used word2vec to help them determine whether a fact is surprising or not, so that they could automatically generate trivia facts.<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>The successes of word2vec have also helped spur on other forms of word embedding\u2014<a href=\"https:\/\/arxiv.org\/pdf\/1506.02761.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">WordRank<\/span><\/a>, Stanford&#8217;s <a href=\"https:\/\/nlp.stanford.edu\/projects\/glove\/\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">GloVe<\/span><\/a>, and Facebook&#8217;s <a href=\"https:\/\/research.fb.com\/projects\/fasttext\/\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s6\">fastText<\/span><\/a>, to name a few major ones. <\/span><\/p>\n\n\n\n<p><span class=\"s5\">These algorithms seek to improve on word2vec &#8212; they also look at texts through different <i>units<\/i>: characters, subwords, words, phrases, sentences, documents, and perhaps even units of thought. As a result, they allow us to think about not just word similarity, but also sentence similarity and<span class=\"Apple-converted-space\">&nbsp; <\/span>document similarity&#8212;like this paper did (Kusner 2015):<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"966\" height=\"464\" src=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy.png\" alt=\"wmd - Copy\" class=\"wp-image-2789\" srcset=\"https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy.png 966w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy-400x192.png 400w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy-768x369.png 768w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy-380x183.png 380w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy-700x336.png 700w, https:\/\/www.springboard.com\/blog\/wp-content\/uploads\/2017\/08\/wmd-Copy-380x183.png 420w\" sizes=\"(max-width: 966px) 100vw, 966px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>Word embeddings <b>transform human language meaningfully into a form conducive to numerical analysis<\/b>. In doing so, they allow computers to explore the wealth of knowledge encoded implicitly into our own ways of speaking. <strong>We&#8217;ve barely scratched the surface of that potential<\/strong>.<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; A<\/span>ny individual programmer or scholar can use these tools and contribute new knowledge. Many areas of research and industry that could benefit from NLP have yet to be explored. Word embeddings and <a href=\"https:\/\/www.springboard.com\/blog\/data-science\/beginners-guide-neural-network-in-python-scikit-learn-0-18\/\" target=\"_blank\" data-type=\"URL\" data-id=\"https:\/\/www.springboard.com\/blog\/data-science\/beginners-guide-neural-network-in-python-scikit-learn-0-18\/\" rel=\"noreferrer noopener\">neural language models<\/a> are powerful techniques. But perhaps the most powerful aspect of machine learning is its collaborative culture. Many, if not most, of the state-of-the-art methods, are open-source, along with their accompanying research.<\/span><\/p>\n\n\n\n<p><span class=\"s5\"><span class=\"Apple-converted-space\">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <\/span>So, it&#8217;s there, if we want to take advantage. Now, the main obstacle is just ourselves. And&#8230;maybe an expensive GPU.<\/span><\/p>\n\n\n\n<p><span class=\"s8\">\/\/<\/span><\/p>\n\n\n\n<p><span class=\"s3\">For the theory behind word embeddings, see<\/span> <a href=\"http:\/\/www.learndatasci.com\/intro-to-word-embeddings-problems-theory\/\" target=\"_blank\" rel=\"noopener\"><span class=\"s9\">Part 2<\/span><\/a><span class=\"s5\">.<\/span><\/p>\n\n\n\n<p><span class=\"s5\">References<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"https:\/\/arxiv.org\/pdf\/1606.02820.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Hamilton 2016<\/span><\/a>] Hamilton, William L., et al. \u201cInducing domain-specific sentiment lexicons from unlabeled corpora.\u201d arXiv preprint arXiv:1606.02820 (2016).<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"http:\/\/proceedings.mlr.press\/v37\/kusnerb15.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Kusner 2015<\/span><\/a>] Kusner, Matt, et al. \u201cFrom word embeddings to document distances.\u201d International Conference on Machine Learning. 2015.<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"http:\/\/www.aclweb.org\/anthology\/N13-1090\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Mikolov 2013a<\/span><\/a>] Mikolov, Tomas, Wen-tau Yih, and Geoffrey Zweig. \u201cLinguistic regularities in continuous space word representations.\u201d hlt-Naacl. Vol. 13. 2013.<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"https:\/\/arxiv.org\/pdf\/1301.3781.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Mikolov 2013b<\/span><\/a>] Mikolov, Tomas, et al. \u201cEfficient estimation of word representations in vector space.\u201d arXiv preprint arXiv:1301.3781 (2013).<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"https:\/\/arxiv.org\/pdf\/1310.4546.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Mikolov 2013c<\/span><\/a>] Mikolov, Tomas, et al. \u201cDistributed representations of words and phrases and their compositionality.\u201d Advances in neural information processing systems. 2013.<\/span><\/p>\n\n\n\n<p><span class=\"s3\">[<a href=\"https:\/\/arxiv.org\/pdf\/1309.4168.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><span class=\"s10\">Mikolov 2013d<\/span><\/a>] Mikolov, Tomas, Quoc V. Le, and Ilya Sutskever. \u201cExploiting similarities among languages for machine translation.\u201d arXiv preprint arXiv:1309.4168 (2013).<\/span><\/p>\n\n\n\n<p class=\"rm has-background\" style=\"background-color:#efeff6\"><strong>Since you\u2019re here\u2026<br><\/strong>Curious about a career in data science? Experiment with our <a rel=\"noreferrer noopener\" href=\"https:\/\/www.springboard.com\/resources\/guides\/data-science-process\/\" target=\"_blank\">free data science learning path<\/a>, or join our <a rel=\"noreferrer noopener\" href=\"https:\/\/www.springboard.com\/courses\/data-science-career-track\/\" target=\"_blank\">Data Science Bootcamp<\/a>, where you\u2019ll get your tuition back if you don&#8217;t land a job after graduating. We\u2019re confident because our courses work \u2013 check out our <a rel=\"noreferrer noopener\" href=\"https:\/\/www.springboard.com\/success\/\" target=\"_blank\">student success stories<\/a> to get inspired.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Understanding Word Embeddings Part 1: Applications Written by Aaron Geelon So If you already have a solid understanding of word embeddings and are well into your data science career, skip ahead to the next part! Human language is unreasonably effective at describing how we relate to the world. With a few, short words, we can [&hellip;]<\/p>\n","protected":false},"author":10,"featured_media":2792,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_eb_attr":"","_eb_data_table":"","footnotes":""},"categories":[67],"tags":[],"marketing_tags":[],"class_list":{"0":"post-2776","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-data-science"},"acf":[],"_links":{"self":[{"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/posts\/2776"}],"collection":[{"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/comments?post=2776"}],"version-history":[{"count":4,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/posts\/2776\/revisions"}],"predecessor-version":[{"id":49277,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/posts\/2776\/revisions\/49277"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/media\/2792"}],"wp:attachment":[{"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/media?parent=2776"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/categories?post=2776"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/tags?post=2776"},{"taxonomy":"marketing_tags","embeddable":true,"href":"https:\/\/www.springboard.com\/blog\/wp-json\/wp\/v2\/marketing_tags?post=2776"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}