{"id":395,"date":"2010-12-27T12:11:24","date_gmt":"2010-12-27T16:11:24","guid":{"rendered":"http:\/\/www.openbible.info\/blog\/?p=395"},"modified":"2016-01-31T20:57:03","modified_gmt":"2016-02-01T00:57:03","slug":"quantifying-traditional-vs-contemporary-language-in-english-bibles-using-google-ngram-data","status":"publish","type":"post","link":"https:\/\/www.openbible.info\/blog\/2010\/12\/quantifying-traditional-vs-contemporary-language-in-english-bibles-using-google-ngram-data\/","title":{"rendered":"Quantifying Traditional vs. Contemporary Language in English Bibles Using Google NGram Data"},"content":{"rendered":"<p>Using data from Google&#8217;s new ngram corpus, here&#8217;s how English Bible translations compare in their use of traditional vs. contemporary vocabulary:<\/p>\n<p><img loading=\"lazy\" src=\"https:\/\/a.openbible.info\/blog\/2010-12-translations.png\" width=\"800\" height=\"600\" alt=\"Relative Traditional vs. Contemporary Language in English Bible Translations\" \/><br \/>\n* Partial Bible (New Testament except for The Voice, which only has the Gospel of John). The colors represent somewhat arbitrary groups.<\/p>\n<p>Here&#8217;s similar data with the most recent publication year (since 1970) as the x-axis:<\/p>\n<p><img loading=\"lazy\" src=\"https:\/\/a.openbible.info\/blog\/2010-12-translations-publication.png\" width=\"800\" height=\"660\" alt=\"Relative Traditional vs. Contemporary Language in English Bible Translations by Publication Year\" \/><\/p>\n<h3>Discussion<\/h3>\n<p>The result accords well with my expectations of translations. It generally follows the &#8220;word for word\/thought for thought&#8221; continuum often used to categorize translations, suggesting that word-for-word, functionally equivalent translations tend toward traditional language, while thought-for-thought, dynamic-equivalent translations sometimes find replacements for traditional words. For reference, here&#8217;s how Bible publisher Zondervan categorizes translations along that continuum:<\/p>\n<p><img loading=\"lazy\" src=\"https:\/\/a.openbible.info\/blog\/2010-12-translations-continuum.png\" width=\"800\" height=\"186\" alt=\"A word-for-word to thought-for-thought continuum lists about twenty English translations, from an interlinear to The Message.\" \/><\/p>\n<p>I&#8217;m not sure what to make of the curious NLT grouping in the first chart above: the five translations are more similar than any others. In particular, I&#8217;d expect the new <a href=\"http:\/\/www.commonenglishbible.com\/\">Common English Bible<\/a> to be more contemporary&#8211;perhaps it will become so once the Old Testament is available and it&#8217;s more comparable to other translations.<\/p>\n<p>In the chart with publication years, notice how no one tries to occupy the same space as the NIV for twenty years until the HCSB comes along.<\/p>\n<p>The World English Bible appears where it does largely because it uses &#8220;Yahweh&#8221; instead of &#8220;LORD.&#8221; If you ignore that word, the WEB shows up between the Amplified and the NASB. (The word <a href=\"http:\/\/ngrams.googlelabs.com\/graph?content=Yahweh&#038;year_start=1800&#038;year_end=2008&#038;corpus=0&#038;smoothing=3\">Yahweh<\/a> has become more popular recently.) Similarly, the New Jerusalem Bible would appear between the HCSB and the NET for the same reason.<\/p>\n<p>The more contemporary versions often use contractions (e.g., <a href=\"http:\/\/ngrams.googlelabs.com\/graph?content=you%27ll&#038;year_start=1800&#038;year_end=2008&#038;corpus=0&#038;smoothing=3\">you&#8217;ll<\/a>), which pulls their score considerably toward the contemporary side.<\/p>\n<p>Religious words (&#8220;God,&#8221; &#8220;Jesus&#8221;) pull translations to the traditional side, since a greater percentage of books in the past dealt with religious subjects. A religious text such as the Bible therefore naturally tends toward older language.<\/p>\n<p>If you&#8217;re looking for translations largely free from copyright restrictions, most of the KJV-grouped translations are public domain. The <a href=\"http:\/\/www.lexhamenglishbible.com\/\">Lexham English Bible<\/a> and the <a href=\"http:\/\/www.ebible.org\/web\/\">World English Bible<\/a> are available in the ESV\/NASB group. The <a href=\"http:\/\/net.bible.org\/\">NET Bible<\/a> is available in the NIV group. Interestingly, all the more contemporary-style translations are under standard copyright; I don&#8217;t know of a project to produce an open thought-for-thought translation&#8211;maybe because there&#8217;s more room for disagreement in such a project?<\/p>\n<p>Not included in the above chart is the <a href=\"http:\/\/www.lolcatbible.com\/\">LOLCat Bible<\/a>, a non-academic attempt to translate the Bible into <a href=\"https:\/\/en.wikipedia.org\/wiki\/Lolcat\">LOLspeak<\/a>. If charted, it appears well to the contemporary side of The Message:<\/p>\n<p><img loading=\"lazy\" src=\"https:\/\/a.openbible.info\/blog\/2010-12-translations-lol.png\" width=\"800\" height=\"41\" alt=\"The KJV is on the far left, The Message is in the middle, and the LOLCat Bible is on the far right.\" \/><\/p>\n<h3>Methodology<\/h3>\n<p>I downloaded the <a href=\"http:\/\/ngrams.googlelabs.com\/datasets\">English 1-gram corpus<\/a> from Google, normalized the words (stripping combining characters and making them case insensitive), and inserted the five million or so unique words into a database table. I combined individual years into decades to lower the row count. Next, I ran a percentage-wise comparison (similar to what <a href=\"http:\/\/ngrams.googlelabs.com\/\">Google&#8217;s ngram viewer<\/a> does) for each word to determine when they were most popular.<\/p>\n<p>Then, I created word counts for a variety of translations, dropped stopwords, and multiplied the counts by the above ngram percentages to arrive at a median year for each translation.<\/p>\n<p>The year scale (x-axis on the first chart, y-axis on the second) runs from 1838 to 1878, largely, as mentioned before, because Bibles use religious language. Even the LOLCat Bible dates to 1921 because it uses words (e.g., &#8220;ceiling cat&#8221;) that don&#8217;t particularly tie it to the present.<\/p>\n<h3>Caveats<\/h3>\n<p>The data doesn&#8217;t present a complete picture of a translation&#8217;s suitability for a particular audience or overall readability. For example, it doesn&#8217;t take into account word order (&#8220;fear not&#8221; vs. &#8220;do not fear&#8221;). (I wanted to use Google&#8217;s two- or three-gram data to see what differences they make, but as of this writing, Google hasn&#8217;t finished uploading them.)<\/p>\n<p>I work for Zondervan, which publishes the NIV family of Bibles, but the work here is my own and I don&#8217;t speak for them.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Using data from Google&#8217;s new ngram corpus, here&#8217;s how English Bible translations compare in their use of traditional vs. contemporary vocabulary: * Partial Bible (New Testament except for The Voice, which only has the Gospel of John). The colors represent somewhat arbitrary groups. Here&#8217;s similar data with the most recent publication year (since 1970) as [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3,17,9],"tags":[],"_links":{"self":[{"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/posts\/395"}],"collection":[{"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/comments?post=395"}],"version-history":[{"count":38,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/posts\/395\/revisions"}],"predecessor-version":[{"id":1140,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/posts\/395\/revisions\/1140"}],"wp:attachment":[{"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/media?parent=395"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/categories?post=395"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.openbible.info\/blog\/wp-json\/wp\/v2\/tags?post=395"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}