<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>OpenBible.info Blog &#187; Visualizations</title>
	<atom:link href="http://www.openbible.info/blog/category/visualizations/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.openbible.info/blog</link>
	<description></description>
	<lastBuildDate>Wed, 08 Feb 2012 01:35:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Re-visualizing Cross References (Interactively)</title>
		<link>http://www.openbible.info/blog/2012/02/re-visualizing-cross-references-interactively/</link>
		<comments>http://www.openbible.info/blog/2012/02/re-visualizing-cross-references-interactively/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 01:35:46 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Cross References]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=569</guid>
		<description><![CDATA[Browse this grid interactively. This visualization is arranged by book, showing cross-reference sources on the y-axis and targets on the x-axis. Within each square, the first verse in the book or section is at the top, and the last verse is at the bottom. Here&#8217;s what a detail of a square looks like: Genesis 1 [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.openbible.info/labs/cross-references/visualization"><img src="http://a.openbible.info/labs/cross-references/grid-800.jpg" width="800" height="742" alt="Visit an interactive visualization of Bible cross references." /></a><br />
<a href="http://www.openbible.info/labs/cross-references/visualization">Browse this grid interactively</a>.</p>
<p>This visualization is arranged by book, showing cross-reference sources on the y-axis and targets on the x-axis. Within each square, the first verse in the book or section is at the top, and the last verse is at the bottom. Here&#8217;s what a detail of a square looks like:</p>
<p><a href="http://a.openbible.info/labs/cross-references/books/Gen-Dan.png"><img src="http://a.openbible.info/labs/cross-references/books/Gen-Dan.png" width="800" alt="Cross references between Genesis and Daniel" /></a></p>
<p>Genesis 1 is at the top left; Genesis 50 is at the bottom left. Daniel 1 is at the top right; Daniel 12 is at the bottom right. The most-striking cross references between these two books, to me, involve Joseph&#8217;s interpretation of dreams in Genesis 40-41 and similar stories in Daniel.</p>
<p>Also see a previous <a href="http://www.openbible.info/blog/2010/04/bible-cross-references-visualization/">cross reference visualization</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2012/02/re-visualizing-cross-references-interactively/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Applying Sentiment Analysis to the Bible</title>
		<link>http://www.openbible.info/blog/2011/10/applying-sentiment-analysis-to-the-bible/</link>
		<comments>http://www.openbible.info/blog/2011/10/applying-sentiment-analysis-to-the-bible/#comments</comments>
		<pubDate>Mon, 10 Oct 2011 16:43:02 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Sentiment]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=517</guid>
		<description><![CDATA[This visualization explores the ups and downs of the Bible narrative, using sentiment analysis to quantify when positive and negative events are happening: Full size download (.png, 4000&#215;4000 pixels). Things start off well with creation, turn negative with Job and the patriarchs, improve again with Moses, dip with the period of the judges, recover with [...]]]></description>
			<content:encoded><![CDATA[<p>This visualization explores the ups and downs of the Bible narrative, using sentiment analysis to quantify when positive and negative events are happening:</p>
<p><a href="http://a.openbible.info/blog/2011-10-sentiment-big.png"><img src="http://a.openbible.info/blog/2011-10-sentiment.png" width="800" height="800" alt ="Sentiment analysis of the Bible." /></a><br />
<a href="http://a.openbible.info/blog/2011-10-sentiment-full.png">Full size download</a> (.png, 4000&#215;4000 pixels).</p>
<p>Things start off well with creation, turn negative with Job and the patriarchs, improve again with Moses, dip with the period of the judges, recover with David, and have a mixed record (especially negative when Samaria is around) during the monarchy. The exilic period isn’t as negative as you might expect, nor the return period as positive. In the New Testament, things start off fine with Jesus, then quickly turn negative as opposition to his message grows. The story of the early church, especially in the epistles, is largely positive.</p>
<h3>Methodology</h3>
<p>Sentiment analysis involves algorithmically determining if a piece of text is positive (“I like cheese”) or negative (“I hate cheese”). Think of it as <a href="http://kottke.org/11/09/kurt-vonnegut-explains-the-shapes-of-stories">Kurt Vonnegut&#8217;s story shapes</a> backed by quantitative data.</p>
<p>I ran the <a href="http://viralheat.com/developer/sentiment_api">Viralheat Sentiment API</a> over several Bible translations to produce a composite sentiment average for each verse. Strictly speaking, the Viralheat API only returns a probability that the given text is positive or negative, not the intensity of the sentiment. For this purpose, however, probability works as a decent proxy for intensity.</p>
<p>The visualization takes a moving average of the data to provide a coherent story; the raw data is more jittery. <a href="http://a.openbible.info/blog/2011-10-sentiment-data.zip">Download the raw data</a> (400 KB .zip).</p>
<h3>Update October 10, 2011</h3>
<p>As requested in the comments, here&#8217;s the data arranged by book with a moving average of five verses on either side. (By comparison, the above visualization uses a moving average of 150 verses on either side.)</p>
<p><a href="http://a.openbible.info/blog/2011-10-sentiment-book-big.png"><img src="http://a.openbible.info/blog/2011-10-sentiment-book.png" width="800" height="1194" alt ="Sentiment analysis of the Bible, arranged by book." /></a><br />
<a href="http://a.openbible.info/blog/2011-10-sentiment-book-full.png">Full size download</a> (.png, 2680&#215;4000 pixels).</p>
<p>Update December 28, 2011: Christianity Today includes this visualization in their December issue (<a href="http://www.christianitytoday.com/ct/2011/december/spotlight-dec11.html">&#8220;How the Bible Feels&#8221;</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/10/applying-sentiment-analysis-to-the-bible/feed/</wfw:commentRss>
		<slash:comments>41</slash:comments>
		</item>
		<item>
		<title>Bible Annotation Modeling and Querying in MySQL and CouchDB</title>
		<link>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/</link>
		<comments>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/#comments</comments>
		<pubDate>Thu, 01 Sep 2011 12:03:55 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Data Modeling]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=464</guid>
		<description><![CDATA[If you’re storing people’s Bible annotations (notes, bookmarks, highlights, etc.) digitally, you want to be able to retrieve them later. Let’s look at some strategies for how to store and look up these annotations. Know What You’re Modeling First you need to understand the shape of the data. I don’t have access to a large [...]]]></description>
			<content:encoded><![CDATA[<p>If you’re storing people’s Bible annotations (notes, bookmarks, highlights, etc.) digitally, you want to be able to retrieve them later. Let’s look at some strategies for how to store and look up these annotations.</p>
<h3>Know What You’re Modeling</h3>
<p>First you need to understand the shape of the data. I don’t have access to a large repository of Bible annotations, but the Twitter and Facebook Bible citations from the <a href="http://www.openbible.info/realtime/">Realtime Bible Search</a> section of this website provide a good approximation of how people cite the Bible. (Quite a few Facebook posts appear to involve people responding to their daily devotions.) These tweets and posts are public, and private annotations may take on a slightly different form, but the general shape of the data should be similar: nearly all (99%) refer to a chapter or less.</p>
<p><a href="http://a.openbible.info/blog/2011-08-social-full.png"><img src="http://a.openbible.info/blog/2011-08-social.png" width="600" height="439" alt="Large dots at the bottom indicate many single-verse references. Chapter references are also fairly prominent. See below for more discussion." /></a></p>
<p><a href="http://www.biblegateway.com/blog/2011/08/how-people-share-the-bible-verses-vs-read-the-bible-chapters/">Compare Bible Gateway reading habits</a>, which are much heavier on chapter-level usage, but 98% of accesses still involve a chapter or less.</p>
<h3>The Numbers</h3>
<p>The data consists of about 35 million total references.</p>
<table class="data">
<tr>
<th class="number">Percent of Total</th>
<th>Description</th>
<th>Example</th>
</tr>
<tr>
<td class="number">73.5</td>
<td>Single verse</td>
<td>John 3:16</td>
</tr>
<tr>
<td class="number">17.1</td>
<td>Verse range in a single chapter</td>
<td>John 3:16-17</td>
</tr>
<tr>
<td class="number">8.4</td>
<td>Exactly one chapter</td>
<td>John 3</td>
</tr>
<tr>
<td class="number">0.7</td>
<td>Two or more chapters (at chapter boundaries)</td>
<td>John 3-4</td>
</tr>
<tr>
<td class="number">0.1</td>
<td>Verses spanning two chapters (not at chapter boundaries)</td>
<td>John 3:16-4:2</td>
</tr>
<tr>
<td class="number">0.1</td>
<td>Verses spanning three or more chapters (not at chapter boundaries)</td>
<td>John 3:16-5:2</td>
</tr>
</table>
<p>About 92.9% of posts or tweets cited only one verse or verse range; 7.1% mentioned more than one verse range. Of the latter, 77% cited exactly two verse ranges; the highest had 323 independent verse ranges. Of Facebook posts, 9.1% contained multiple verse ranges, compared to 4.2% of tweets. When there were multiple ranges, 43% of the time they referred to verses in different books from the other ranges; 39% referred to verses in the same book (but not in the same chapter); and 18% referred to verses in the same chapter. (This distribution is a unusual—normally close verses stick together.)</p>
<p>The data, oddly, doesn’t contain any references that span multiple books. Less than 0.01% of passage accesses span multiple books on Bible Gateway, which is probably a useful upper bound for this type of data.</p>
<h4>Key Points</h4>
<ol>
<li>Nearly all citations involve verses in the same chapter; only 1% involve verses in multiple chapters.</li>
<li>Of the 1% spanning two or more chapters, most refer to exact chapter boundaries.</li>
<li>Multiple-book references are even more unusual (under 0.01%) but have outsize effects: an annotation that references Genesis 1 to Revelation 22 would be relevant for every verse in the Bible.</li>
<li>Around 7% of notes contained multiple independent ranges of verses—the more text you allow for an annotation, the more likely someone is to mention multiple verses.</li>
</ol>
<h4>Download</h4>
<p><a href="http://a.openbible.info/blog/2011-08-social-lengths.zip">Download the raw social data</a> (1.4 MB zip) under the usual CC-Attribution license.</p>
<h3>Data Modeling</h3>
<p>A Bible annotation consists of arbitrary content (a highlight might have one kind of content, while a proper note might have a title, body, attachments, etc., but modeling the content itself isn&#8217;t the point of this piece) tied to one or more Bible references:</p>
<ol>
<li>A single verse (John 3:16).</li>
<li>A single range (John 3:16-17).</li>
<li>Multiple verses or ranges (John 3:16, John 3:18-19)</li>
</ol>
<h3>The Relational Model</h3>
<p>One user can have many rows of annotations, and one annotation can have many rows of verses that it refers to. To model a Bible annotation relationally, we set up three tables that look something like this:</p>
<h4>users</h4>
<table class="data">
<tr>
<th>user_id</th>
<th>name</th>
</tr>
<tr>
<td>1</td>
<td>…</td>
</tr>
</table>
<h4>annotations</h4>
<table class="data">
<tr>
<th>user_id</th>
<th>annotation_id</th>
<th>content</th>
</tr>
<tr>
<td>1</td>
<td>101</td>
<td>…</td>
</tr>
<tr>
<td>1</td>
<td>102</td>
<td>…</td>
</tr>
<tr>
<td>1</td>
<td>103</td>
<td>…</td>
</tr>
</table>
<h4>annotation_verses</h4>
<p> The verse references here are integers to allow for easy range searches: 43 = John (the 43rd book in the typical Protestant Bible); 003 = the third chapter; the last three digits = the verse number.</p>
<p>I like using this approach over others (sequential integer or separate columns for book, chapter, and verse) because it limits the need for a lookup table. (You just need to know that 43 = John, and then you can find any verse or range of verses in that book.) It also lets you find all the annotations for a particular chapter without having to know how many verses are in the chapter. (The longest chapter in the Bible has 176 verses, so you know that all the verses in John 3, for example, fall between 43003001 and 43003176.) This main disadvantage is that you don’t necessarily know how many verses you’re selecting until after you’ve selected them. And using individual columns, unlike here, does allow you to run <code>group by</code> queries to get easy counts.</p>
<table class="data">
<tr>
<th>annotation_id</th>
<th>start_verse</th>
<th>end_verse</th>
</tr>
<tr>
<td>101</td>
<td>43003016</td>
<td>43003016</td>
</tr>
<tr>
<td>102</td>
<td>43003016</td>
<td>43003017</td>
</tr>
<tr>
<td>103</td>
<td>43003016</td>
<td>43003016</td>
</tr>
<tr>
<td>103</td>
<td>43003019</td>
<td>43003020</td>
</tr>
</table>
<h3>Querying</h3>
<p>In a Bible application, the usual mode of accessing annotations is by passage: if you’re looking at John 3:16-18, you want to see all your annotations that apply to that passage.</p>
<h3>Querying MySQL</h3>
<p>In SQL terms:</p>
<p><code>select distinct(annotations.annotation_id)<br />
from annotations, annotation_verses<br />
where annotation_verses.start_verse &lt;= 43003018 and<br />
annotation_verses.end_verse &gt;= 43003016 and<br />
annotations.user_id = 1 and<br />
annotations.annotation_id = annotation_verses.annotation_id<br />
order by annotation_verses.start_verse asc, annotation_verses.end_verse desc</code></p>
<p>The quirkiest part of the SQL is the first part of the “where” clause, which at first glance looks backward: why is the last verse in the <code>start_verse</code> field and the first verse in the <code>end_verse</code> field? Because the <code>start_verse</code> and <code>end_verse</code> can span any range of verses, you need to make sure that you get any range that overlaps the verses you’re looking for: in other words, the <code>start_verse</code> is before the end of the range, and the <code>end_verse</code> is after the start.</p>
<p>Visually, you can think of each <code>start_verse</code> and <code>end_verse</code> pair as a line: if the line overlaps the shaded area you’re looking for, then it’s a relevant annotation. If not, it’s not relevant. There are six cases:</p>
<p><img src="http://a.openbible.info/blog/2011-08-before-after.png" width="516" height="277" alt="Start before, end before: John 3:15 / Start before, end inside: John 3:15-17 / Start before, end after: John 3:15-19 / Start inside, end inside: John 3:16-18 / Start inside, end after: John 3:17-19 / Start after, end after: John 3:19" /></p>
<p>The other trick in the SQL is the sort order: you generally want to see annotations in canonical order, starting with the longest range first. In other words, you start with an annotation about John 3, then to a section inside John 3, then to individual verses. In this way, you move from the broadest annotations to the narrowest annotations. You may want to switch up this order, but it makes a good default.</p>
<p>The relational approach works pretty well. If you worry about the performance implications of the SQL join, you can always put the <code>user_id</code> in <code>annotation_verses</code> or use a view or something.</p>
<h3>Querying CouchDB</h3>
<p><a href="http://couchdb.apache.org/">CouchDB</a> is one of the oldest entrants in the NoSQL space and distinguishes itself by being both a key-value store and queryable using map-reduce:  the usual way to access more than one document in a single query is to write Javascript to output the data you want. It lets you create complex keys to query by, so you might think that you can generate a key like <code>[start_verse,end_verse]</code> and query it like this: <code>?startkey=[0,43003016]&amp;endkey=[43003018,99999999]</code></p>
<p>But no. Views are one-dimensional, meaning that CouchDB doesn’t even look at the second element in the key if the first one matches the query. For example, an annotation with both a start and end verse of <code>19001001</code> matches the above query, which isn’t useful for this purpose.</p>
<p>I can think of two ways to get around this limitation, both of which have drawbacks.</p>
<h4>GeoCouch</h4>
<p>CouchDB has a plugin called GeoCouch that lets you query geographic data, which actually maps well to this data model. (I didn’t come up with this approach on my own: see <a href="http://www.diretto.org/2010/08/efficient-time-based-range-queries-in-couchdb-using-geocouch/">Efficient Time-based Range Queries in CouchDB using GeoCouch</a> for the background.)</p>
<p>The basic idea is to treat each <code>start_verse,end_verse</code> pair as a point on a two-dimensional grid. Here’s the above social data plotted this way:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid.png" width="600" height="551" alt="A diagonal line starts in the bottom left corner and continues to the top right. Large dots indicate popular verses, and book outlines are visible." /></p>
<p>The line bisects the grid diagonally since an <code>end_verse</code> never precedes a <code>start_verse</code>: the diagonal line where <code>start_verse = end_verse</code> indicates the lower bound of any reference. Here are some points indicating where ranges fall on the plot:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid-points.png" width="600" height="554" alt="This chart looks the same as the previous one but has points marked to illustrate that longer ranges are farther away from the bisecting line." /></p>
<p>To find all the annotations relevant to John 3:16-18, we draw a region starting in the upper left and continuing to the point <code>43003018,43003016</code>:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid-bbox.png" width="600" height="551" alt="This chart looks the same as the previous one but has a box from the top left ending just above and past the beginning of John near the upper right of the chart." /></p>
<p>GeoCouch allows exactly this kind of bounding-box query: <code>?bbox=0,43003016,43003018,99999999</code></p>
<p>You can even support multiple users in this scheme: just give everyone their own, independent box. I might occupy 1&#215;1 (with an annotation at <code>1.43003016,1.43003016</code>), while you might occupy 2&#215;2 (with an annotation at <code>2.43003016,2.43003016</code>); queries for our annotations would never overlap. Each whole number to the left of the decimal acts as a namespace.</p>
<p>The drawbacks:</p>
<ol>
<li>The results aren’t sorted in a useful way. You’ll need to do sorting on the client side or in a <a href="http://guide.couchdb.org/editions/1/en/show.html">show function</a>.</li>
<li>You don’t get pagination.</li>
</ol>
<h4>Repetition at Intervals</h4>
<p>Given the shape of the data, which is overwhelmingly chapter-bound (and lookups, which at least on Bible Gateway are chapter-based), you could simply repeat chapter-spanning annotations at the beginning of every chapter. In the worst case annotation (Genesis 1-Revelation 22), you end up with about 1200 repetitions.</p>
<p>For example, in the Genesis-Revelation case, for John 3 you might create a key like <code>[43000000.01001001,66022021]</code> so that it sorts at the beginning of the chapter—and if you have multiple annotations with different start verses, they stay sorted properly.</p>
<p>To get annotations for John 3:16-18, you’d query for <code>?startkey=[43003000]&amp;endkey=[43003018,{}]</code></p>
<p>The drawbacks:</p>
<ol>
<li>You have to filter out all the irrelevant annotations: if you have a lot of annotations about John 3:14, you have to skip through them all before you get to the ones about John 3:16.</li>
<li>You have to filter out duplicates when the range you’re querying for spans multiple chapters.</li>
<li>You’re repeating yourself, though given how rarely a multi-chapter span (let alone a multi-book span) happens in the wild, it might not matter that much.</li>
</ol>
<h4>Other CouchDB Approaches</h4>
<p>Both these approaches assume that you want to make only one query to retrieve the data. If you’re willing to make multiple queries, you could create different list functions and query them in parallel: for example, you could have one for single-chapter annotations and one for multi-chapter annotations. See <a href="http://en.wikipedia.org/wiki/Interval_tree">interval trees</a> and <a href="http://en.wikipedia.org/wiki/Geohash">geohashes</a> for additional ideas. You could also introduce a separate query layer, such as <a href="http://www.elasticsearch.org/">elasticsearch</a>, to sit <a href="http://www.elasticsearch.org/guide/reference/river/couchdb.html">on top of CouchDB</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Holy Week Timeline: Behind the Music</title>
		<link>http://www.openbible.info/blog/2011/04/holy-week-timeline-behind-the-music/</link>
		<comments>http://www.openbible.info/blog/2011/04/holy-week-timeline-behind-the-music/#comments</comments>
		<pubDate>Sun, 17 Apr 2011 01:24:49 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=444</guid>
		<description><![CDATA[It’s always fun for me to learn the process people use to create visualizations, and especially why they made the decisions they did. So please forgive me if you find this post self-indulgent; I’m going to talk about the new Holy Week Timeline that’s on the Bible Gateway blog: The idea for this visualization started [...]]]></description>
			<content:encoded><![CDATA[<p>It’s always fun for me to learn the process people use to create visualizations, and especially why they made the decisions they did. So please forgive me if you find this post self-indulgent; I’m going to talk about the new <a href="http://www.biblegateway.com/blog/2011/04/holy-week-timeline-visualization/">Holy Week Timeline</a> that’s on the Bible Gateway blog:</p>
<p><a href="http://www.biblegateway.com/blog/2011/04/holy-week-timeline-visualization/"><img src="http://a.openbible.info/blog/2011-04-holy-week-final.png" width="799" height="223" alt="Holy Week timeline" /></a></p>
<p>The idea for this visualization started in November 2009 when xkcd published its <a href="http://xkcd.com/657/">movie narrative charts</a> comic, which bubbled up through the Internet and shortly thereafter <a href="http://www.google.com/insights/search/#q=movie%20narrative%20charts&#038;cmpt=q">became a meme</a>. Although the charts are really just setting up a joke for the last two panels in the comic, they’re also a fantastic way of visualizing narratives, providing a quick way to see what’s going on in a story at any point in time. The format also forces you to consider what’s happening offstage—it’s not like the other characters cease to exist just because you’re not seeing them and hearing about them.</p>
<p>My first thought was to plot the book of Acts this way, but Acts presented too broad a scope to manage in a reasonable timeframe. Holy Week then came to mind—it involves a limited amount of time and space, it doesn’t feature too many characters, and the Gospels recount it in a good bit of detail: one Gospel often fills in gaps in another’s account.</p>
<p>Now I needed data. (Data is always the holdup in creating visualizations.) Fortunately, Gospel harmonies are prevalent, even free ones online. The version of <a href="http://www.logos.com/">Logos</a> I have includes <a href="http://en.wikipedia.org/wiki/Archibald_Thomas_Robertson">A. T. Robertson</a>’s <i>Harmony of the Gospels</i>, so I started transcribing verse references from the pericopes listed there into a spreadsheet, identifying who’s in each one and when and where it takes place. I plowed halfway through, but then other priorities arose, and I had to abandon hopes of completing it in time for Holy Week 2010.</p>
<p>It lay dormant for a year (there’s not a lot of reason to publish something on Holy Week unless Holy Week is nigh). A few weeks ago, I finished itemizing the people, places, and times in Robertson. Justin Taylor last year published a <a href="http://thegospelcoalition.org/blogs/justintaylor/category/holy-week/">harmony of Holy Week</a> based on the ESV Study Bible, which had a slightly different take on the timeline (one that made more sense to me in certain areas), so I moved a few things around on my spreadsheet. I also consulted a couple of other study Bibles and books I had readily available to me.</p>
<p>With data in hand, it was time to put pencil to paper.</p>
<h3>Version 1: Paper</h3>
<p><img src="http://a.openbible.info/blog/2011-04-holy-week-hand.jpg" width="779" height="425" alt="Hand-drawn prototype" /></p>
<p>I wanted to make four basic changes to the xkcd comic: use the vertical axis consistently to show spatial progression, provide close-ups for complex narrative sequences, include every character and event, and add the days of the week to orient the viewer in time. Only the last of these changes wound up in the final product, however.</p>
<p>The vertical axis in this version proceeded from Bethany at the top, through the Mount of Olives and various places in Jerusalem, and ended at Emmaus. On a <a href="http://www.openbible.info/blog/2007/04/passion-week-in-google-earth/">map of Holy Week events</a>, this axis approximates a line running from east (Bethany) to west (Emmaus). Using the vertical axis this way encodes more information into the chart, allowing you to see everything that happened in a single location simply by following an imaginary horizontal line across the chart. Unfortunately, it also leads to a lopsided chart that progresses down and to the right, creating huge amounts of whitespace on a rectangular canvas. I didn’t see that problem yet, however.</p>
<p>I did see that the right half of the chart (Friday to Sunday) was much denser than the left half—I’d need to space that out better when creating a digital version.</p>
<h3>Version 2: Drawing Freehand in Illustrator</h3>
<p><img src="http://a.openbible.info/blog/2011-04-holy-week-freehand.png" width="800" height="215" alt="Mouse-drawn prototype" /></p>
<p>I have a confession: I&#8217;d never used Adobe Illustrator before this project. Most of my image work uses pixels; Photoshop is my constant companion. But this project would have to use vectors to allow for the constant fiddling necessary to produce a decent result at multiple sizes. So, Illustrator it was.</p>
<p>My first goal was to reproduce the pencil drawing with reasonable fidelity. I used my mouse to draw deliberately wobbly lines that mimicked the xkcd comic. Now, if I’d had more experience with Illustrator, the hand-drawn effect may have worked. But making changes was incredibly annoying; I had to delete sections, redraw them, and then join them to the existing lines. It took forever to make minor tweaks; what would I do when I needed to move whole lines around (as frequently happened later in the process)? After all, if you look closely, you’ll see entire swaths of the chart misplaced. (Why are the disciples hanging out in the Temple after Jesus’ arrest?) No, this hand-drawn approach was impractical for someone of my limited Illustrator experience. I needed straight lines and a grid.</p>
<h3>Version 3: The Grid</h3>
<p><img src="http://a.openbible.info/blog/2011-04-holy-week-grid.png" width="800" height="258" alt="Grooving with a 1970s grid style" /></p>
<p>My wife says that this version reminds her of 1970s-style album covers. She’s right. Nevertheless, it formed the foundation of the final product.</p>
<p>So, what are the problems here? First, the lines weigh too much. Having given up a pure freehand approach, I wanted a more <a href="http://en.wikipedia.org/wiki/Transit_map">transit-style map</a> (used for subways / the Underground) with straight lines and restricted angles. I’m most familiar with Chicago’s <a href="http://www.transitchicago.com/assets/1/maps/ctatrainmap.png">CTA map</a> and thought I’d emulate their style of thick lines that almost touch. This approach leads to lots of heavy lines that don’t convey additional information—it’s also tricky to round the corners of such thick lines without unsightly gaps appearing (again, for someone of my limited Illustrator experience).</p>
<p>The second problem is the extreme weight in the upper left of the chart, far out of proportion to the gravity of events there. The green, brown, and black lines represent Peter, James, and Judas, who don’t play prominent roles until later in the story. They’re adding lots of “ink” to the chart and not conveying useful information. They had to go.</p>
<p>Why not simply lighten the colors&#8211;after all, why is Judas’s line black? Simple: black often represents evil. Similarly, Jesus’ line is red to represent his blood. The Jewish leaders are blue because it contrasts well with red, and most of the chart involves conflict between Jesus and the Jewish leaders (with the pink crowd usually acting as a buffer to prevent Jesus&#8217; arrest). Pilate and Herod are imperial purple. Orange is similar in hue to Jesus’ red, so the disciples are orange. I tried not to get too heavy-handed with the symbolism, but there it is.</p>
<p>Most of the other colors are arbitrary (i.e., available in Illustrator’s default palette and of roughly the same saturation as the symbolic colors). John would be sharing a lot of space on the chart with Mary Magdalene and the other women, so I tried to give them colors (green, olive, yellow) that worked well together. The only future change from the color scheme in this version involves the guards, who change from cyan (too bright) to a light purple.</p>
<h3>Version 4: Less Technicolor</h3>
<p><img src="http://a.openbible.info/blog/2011-04-holy-week-muted.png" width="800" height="258" alt="Lighter lines open up the image considerably" /></p>
<p>This version reduced the line weight and introduced Peter, John, and Judas only when they needed to appear as independent entities in the story. It works better, but there are still two problems with it.</p>
<p>First, look at the giant areas of whitespace in the bottom left and top right (especially the top right). Using the vertical axis to indicate absolute travel through space is a nice idea, but I couldn’t figure out how to do it without creating these huge gaps. In the next version, I abandoned the vertical-axis-as-space idea—it now indicated travel between places, but you could no longer follow a horizontal line to see everything that happened in a single place.</p>
<p>Second, I realized that I wouldn’t be able to incorporate every event and person, as they added clutter. I could have added close-ups to illustrate these details—obviously there was enough space for them. However, I felt that including them would distract from the main point: to show Holy Week at a glance. I’m still a bit torn over omitting them, but I think it was a better decision to reduce the total space used by the chart.</p>
<p>I also abandoned the idea that Jesus went to the Temple on Wednesday. Some commentators think he did; others disagree. From a story-structure standpoint, I like the idea that Judas slipped away from the other disciples to bargain for his thirty pieces of silver while Jesus was teaching in the Temple. However, the text is ambiguous on when exactly Judas agreed to betray Jesus and what Jesus was doing on Wednesday.</p>
<h3>Version 5: Text</h3>
<p><img src="http://a.openbible.info/blog/2011-04-holy-week-final.png" width="799" height="223" alt="Final version with text" /></p>
<p>This is the final version. It condenses a lot of vertical and horizontal space; moves some lines around so they overlap less; and, most importantly, adds text: titles for major events; shading and place names for major locations; verse references; line labels; and a short explanation.</p>
<p>The xkcd chart is brilliant in that it doesn’t need a key: following recent trends in UI design, all the labels are inline. I definitely wanted to keep that approach, which meant making lots of labels and placing them on the lines. Again, my lack of experience with Illustrator showed up: I couldn’t get the text to center on the lines automatically, and I had trouble creating an outer glow on the text to provide some contrast with the background and make sure that the text was legible. (Black text on a bright blue background is an unpleasant combination.) But the glow always ate into the letters. Thus, I ended up creating lots of pixel-perfect, partially transparent rectangles as backgrounds for the labels. Some of the person lines had somehow slipped out of alignment with the grid, so I had to do a lot of clicking to get things back into order. In retrospect, it was good that I had to make the rectangles; I might not otherwise have noticed that the lines weren’t all where they were supposed to be.</p>
<p>The shaded boxes to indicate places are straight-up rounded rectangles (though I’m not sure why the corner radius is a global application preference in Illustrator). These boxes, borrowed from xkcd, replace the vertical-axis idea I earlier toyed with.</p>
<p>Finally, I added event titles and verse references. Here I tried to be comprehensive, including references even when I didn’t have a title to put with them. For example, there are two fig tree stories in the Gospels, but I only titled one of them. The references are available to you if you want to read both, though.</p>
<h3>Conclusion</h3>
<p>This project was fun, if time-consuming. In total, it took somewhere between forty and sixty hours (much of it spent climbing Illustrator&#8217;s learning curve). The chart ended up looking less like the xkcd comic and more like a transit map than I was expecting at the outset, but that&#8217;s OK. I’m now a whole lot more familiar with the Holy Week timeline, and I hope that others find the chart useful, too. If it helps improve Bible literacy even a little bit, then I consider it a success.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/04/holy-week-timeline-behind-the-music/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Twitterers Are Giving up for Lent (2011 Edition)</title>
		<link>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/</link>
		<comments>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/#comments</comments>
		<pubDate>Fri, 11 Mar 2011 01:38:10 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=432</guid>
		<description><![CDATA[Congratulations, I guess, go this year to Charlie Sheen, who came in at both #23 and, with &#8220;tiger blood,&#8221; at #90. Justin Bieber is up several spots this year, so he hasn&#8217;t quite crested yet. The next-highest celebrity, who didn&#8217;t make the top 100, is British boy band One Direction. &#8220;Trophies,&#8221; at #69, refers to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://a.openbible.info/blog/2011-03-lent-big.png"><img src="http://a.openbible.info/blog/2011-03-lent.png" width="800" height="422" alt="The top 100 things that people on Twitter are giving up for Lent in 2011." /></a></p>
<p>Congratulations, I guess, go this year to Charlie Sheen, who came in at both #23 and, with &#8220;tiger blood,&#8221; at #90. Justin Bieber is up several spots this year, so he hasn&#8217;t quite crested yet. The next-highest celebrity, who didn&#8217;t make the top 100, is British boy band <a href="http://twitter.com/onedirection">One Direction</a>.</p>
<p>&#8220;Trophies,&#8221; at #69, refers to the English soccer club <a href="http://en.wikipedia.org/wiki/Arsenal_F.C.">Arsenal</a>&#8216;s recent defeat, or something.</p>
<p>The later start to Lent this year means that &#8220;snow&#8221; doesn&#8217;t appear on the list&#8211;<a href="http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/">last year</a>, it was #48. Myspace hangs on at #99, dropping 48 places.</p>
<p>This list draws from 85,000 tweets from March 7-10, 2011, and excludes retweets.</p>
<table class="data">
<tr>
<th>Rank</th>
<th>Word</th>
<th>Count</th>
<th>Change from last year&#8217;s rank</th>
</tr>
<tr>
<td>1.</td>
<td>Twitter</td>
<td>4297</td>
<td>0</td>
</tr>
<tr>
<td>2.</td>
<td>Facebook</td>
<td>4060</td>
<td>0</td>
</tr>
<tr>
<td>3.</td>
<td>Chocolate</td>
<td>3185</td>
<td>0</td>
</tr>
<tr>
<td>4.</td>
<td>Swearing</td>
<td>2527</td>
<td>+1</td>
</tr>
<tr>
<td>5.</td>
<td>Alcohol</td>
<td>2347</td>
<td>-1</td>
</tr>
<tr>
<td>6.</td>
<td>Sex</td>
<td>2093</td>
<td>+3</td>
</tr>
<tr>
<td>7.</td>
<td>Soda</td>
<td>1959</td>
<td>-1</td>
</tr>
<tr>
<td>8.</td>
<td>Lent</td>
<td>1493</td>
<td>-1</td>
</tr>
<tr>
<td>9.</td>
<td>Meat</td>
<td>1352</td>
<td>-1</td>
</tr>
<tr>
<td>10.</td>
<td>Fast food</td>
<td>1303</td>
<td>0</td>
</tr>
<tr>
<td>11.</td>
<td>Sweets</td>
<td>1252</td>
<td>0</td>
</tr>
<tr>
<td>12.</td>
<td>Giving up things</td>
<td>778</td>
<td>+7</td>
</tr>
<tr>
<td>13.</td>
<td>School</td>
<td>768</td>
<td>+27</td>
</tr>
<tr>
<td>14.</td>
<td>Religion</td>
<td>745</td>
<td>+1</td>
</tr>
<tr>
<td>15.</td>
<td>Coffee</td>
<td>707</td>
<td>-3</td>
</tr>
<tr>
<td>16.</td>
<td>You</td>
<td>675</td>
<td>+6</td>
</tr>
<tr>
<td>17.</td>
<td>Social networking</td>
<td>665</td>
<td>+15</td>
</tr>
<tr>
<td>18.</td>
<td>Chips</td>
<td>664</td>
<td>+3</td>
</tr>
<tr>
<td>19.</td>
<td>Junk food</td>
<td>594</td>
<td>-1</td>
</tr>
<tr>
<td>20.</td>
<td>Bread</td>
<td>571</td>
<td>+6</td>
</tr>
<tr>
<td>21.</td>
<td>Smoking</td>
<td>555</td>
<td>-4</td>
</tr>
<tr>
<td>22.</td>
<td>Candy</td>
<td>541</td>
<td>-8</td>
</tr>
<tr>
<td>23.</td>
<td>Charlie Sheen</td>
<td>511</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>24.</td>
<td>Work</td>
<td>482</td>
<td>+4</td>
</tr>
<tr>
<td>25.</td>
<td>Stuff</td>
<td>467</td>
<td>-2</td>
</tr>
<tr>
<td>26.</td>
<td>Catholicism</td>
<td>436</td>
<td>-10</td>
</tr>
<tr>
<td>27.</td>
<td>Food</td>
<td>395</td>
<td>+3</td>
</tr>
<tr>
<td>28.</td>
<td>Shopping</td>
<td>363</td>
<td>+1</td>
</tr>
<tr>
<td>29.</td>
<td>Marijuana</td>
<td>358</td>
<td>+31</td>
</tr>
<tr>
<td>30.</td>
<td>Beer</td>
<td>346</td>
<td>-10</td>
</tr>
<tr>
<td>31.</td>
<td>Fried food</td>
<td>307</td>
<td>-7</td>
</tr>
<tr>
<td>32.</td>
<td>Homework</td>
<td>306</td>
<td>+27</td>
</tr>
<tr>
<td>33.</td>
<td>Cheese</td>
<td>297</td>
<td>+4</td>
</tr>
<tr>
<td>34.</td>
<td>Cookies</td>
<td>293</td>
<td>+11</td>
</tr>
<tr>
<td>35.</td>
<td>Red meat</td>
<td>285</td>
<td>-10</td>
</tr>
<tr>
<td>36.</td>
<td>Masturbation</td>
<td>285</td>
<td>+8</td>
</tr>
<tr>
<td>37.</td>
<td>Virginity</td>
<td>253</td>
<td>+26</td>
</tr>
<tr>
<td>38.</td>
<td>Pancakes</td>
<td>252</td>
<td>+20</td>
</tr>
<tr>
<td>39.</td>
<td>Rice</td>
<td>236</td>
<td>-5</td>
</tr>
<tr>
<td>40.</td>
<td>Booze</td>
<td>235</td>
<td>+2</td>
</tr>
<tr>
<td>41.</td>
<td>Coke</td>
<td>234</td>
<td>-3</td>
</tr>
<tr>
<td>42.</td>
<td>Boys</td>
<td>229</td>
<td>+24</td>
</tr>
<tr>
<td>43.</td>
<td>Sugar</td>
<td>229</td>
<td>-16</td>
</tr>
<tr>
<td>44.</td>
<td>Sobriety</td>
<td>226</td>
<td>+10</td>
</tr>
<tr>
<td>45.</td>
<td>Procrastination</td>
<td>226</td>
<td>-10</td>
</tr>
<tr>
<td>46.</td>
<td>Nothing</td>
<td>219</td>
<td>+21</td>
</tr>
<tr>
<td>47.</td>
<td>Winning</td>
<td>219</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>48.</td>
<td>Ice cream</td>
<td>211</td>
<td>-7</td>
</tr>
<tr>
<td>49.</td>
<td>Caffeine</td>
<td>203</td>
<td>-16</td>
</tr>
<tr>
<td>50.</td>
<td>McDonald&#8217;s</td>
<td>195</td>
<td>+27</td>
</tr>
<tr>
<td>51.</td>
<td>Church</td>
<td>188</td>
<td>+28</td>
</tr>
<tr>
<td>52.</td>
<td>Wine</td>
<td>188</td>
<td>-3</td>
</tr>
<tr>
<td>53.</td>
<td>TV</td>
<td>184</td>
<td>-7</td>
</tr>
<tr>
<td>54.</td>
<td>Starbucks</td>
<td>183</td>
<td>-15</td>
</tr>
<tr>
<td>55.</td>
<td>Texting</td>
<td>182</td>
<td>-12</td>
</tr>
<tr>
<td>56.</td>
<td>Liquor</td>
<td>181</td>
<td>-1</td>
</tr>
<tr>
<td>57.</td>
<td>Negativity</td>
<td>180</td>
<td>+26</td>
</tr>
<tr>
<td>58.</td>
<td>Carbs</td>
<td>179</td>
<td>+10</td>
</tr>
<tr>
<td>59.</td>
<td>Christianity</td>
<td>177</td>
<td>-12</td>
</tr>
<tr>
<td>60.</td>
<td>Justin Bieber</td>
<td>176</td>
<td>+9</td>
</tr>
<tr>
<td>61.</td>
<td>Pizza</td>
<td>175</td>
<td>-11</td>
</tr>
<tr>
<td>62.</td>
<td>French fries</td>
<td>159</td>
<td>+2</td>
</tr>
<tr>
<td>63.</td>
<td>Me</td>
<td>157</td>
<td>+9</td>
</tr>
<tr>
<td>64.</td>
<td>Losing</td>
<td>155</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>65.</td>
<td>Men</td>
<td>152</td>
<td>-13</td>
</tr>
<tr>
<td>66.</td>
<td>Fizzy drinks</td>
<td>151</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>67.</td>
<td>Porn</td>
<td>147</td>
<td>+4</td>
</tr>
<tr>
<td>68.</td>
<td>Lint</td>
<td>147</td>
<td>-11</td>
</tr>
<tr>
<td>69.</td>
<td>Trophies</td>
<td>144</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>70.</td>
<td>Tumblr</td>
<td>144</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>71.</td>
<td>Desserts</td>
<td>142</td>
<td>-15</td>
</tr>
<tr>
<td>72.</td>
<td>Chicken</td>
<td>140</td>
<td>+15</td>
</tr>
<tr>
<td>73.</td>
<td>Pork</td>
<td>139</td>
<td>-3</td>
</tr>
<tr>
<td>74.</td>
<td>Cake</td>
<td>132</td>
<td>+8</td>
</tr>
<tr>
<td>75.</td>
<td>Tea</td>
<td>127</td>
<td>+19</td>
</tr>
<tr>
<td>76.</td>
<td>Sarcasm</td>
<td>127</td>
<td>+14</td>
</tr>
<tr>
<td>77.</td>
<td>Diet Coke</td>
<td>119</td>
<td>-16</td>
</tr>
<tr>
<td>78.</td>
<td>Laziness</td>
<td>118</td>
<td>-13</td>
</tr>
<tr>
<td>79.</td>
<td>Sleep</td>
<td>117</td>
<td>-6</td>
</tr>
<tr>
<td>80.</td>
<td>Jesus</td>
<td>115</td>
<td>-4</td>
</tr>
<tr>
<td>81.</td>
<td>College</td>
<td>111</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>82.</td>
<td>Internet</td>
<td>110</td>
<td>-46</td>
</tr>
<tr>
<td>83.</td>
<td>Complaining</td>
<td>108</td>
<td>-9</td>
</tr>
<tr>
<td>84.</td>
<td>Breathing</td>
<td>103</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>85.</td>
<td>Takeout</td>
<td>98</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>86.</td>
<td>Beef</td>
<td>98</td>
<td>-8</td>
</tr>
<tr>
<td>87.</td>
<td>People</td>
<td>96</td>
<td>+11</td>
</tr>
<tr>
<td>88.</td>
<td>New Year&#8217;s resolutions</td>
<td>96</td>
<td>+1</td>
</tr>
<tr>
<td>89.</td>
<td>Him</td>
<td>94</td>
<td>-5</td>
</tr>
<tr>
<td>90.</td>
<td>Tiger blood</td>
<td>92</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>91.</td>
<td>Makeup</td>
<td>91</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>92.</td>
<td>Juice</td>
<td>90</td>
<td>-7</td>
</tr>
<tr>
<td>93.</td>
<td>Clothes</td>
<td>89</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>94.</td>
<td>My phone</td>
<td>88</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>95.</td>
<td>God</td>
<td>87</td>
<td>-15</td>
</tr>
<tr>
<td>96.</td>
<td>Abstinence</td>
<td>85</td>
<td>-15</td>
</tr>
<tr>
<td>97.</td>
<td>Stress</td>
<td>84</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>98.</td>
<td>Chipotle</td>
<td>82</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>99.</td>
<td>Myspace</td>
<td>81</td>
<td>-48</td>
</tr>
<tr>
<td>100.</td>
<td>Eating out</td>
<td>81</td>
<td>-25</td>
</tr>
</table>
<p>Image created using <a href="http://www.wordle.net/">Wordle</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Quantifying Traditional vs. Contemporary Language in English Bibles Using Google NGram Data</title>
		<link>http://www.openbible.info/blog/2010/12/quantifying-traditional-vs-contemporary-language-in-english-bibles-using-google-ngram-data/</link>
		<comments>http://www.openbible.info/blog/2010/12/quantifying-traditional-vs-contemporary-language-in-english-bibles-using-google-ngram-data/#comments</comments>
		<pubDate>Mon, 27 Dec 2010 16:11:24 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Bible]]></category>
		<category><![CDATA[Linguistics]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=395</guid>
		<description><![CDATA[Using data from Google&#8217;s new ngram corpus, here&#8217;s how English Bible translations compare in their use of traditional vs. contemporary vocabulary: * Partial Bible (New Testament except for The Voice, which only has the Gospel of John). The colors represent somewhat arbitrary groups. Here&#8217;s similar data with the most recent publication year (since 1970) as [...]]]></description>
			<content:encoded><![CDATA[<p>Using data from Google&#8217;s new ngram corpus, here&#8217;s how English Bible translations compare in their use of traditional vs. contemporary vocabulary:</p>
<p><img src="http://a.openbible.info/blog/2010-12-translations.png" width="800" height="600" alt="Relative Traditional vs. Contemporary Language in English Bible Translations" /><br />
* Partial Bible (New Testament except for The Voice, which only has the Gospel of John). The colors represent somewhat arbitrary groups.</p>
<p>Here&#8217;s similar data with the most recent publication year (since 1970) as the x-axis:</p>
<p><img src="http://a.openbible.info/blog/2010-12-translations-publication.png" width="800" height="660" alt="Relative Traditional vs. Contemporary Language in English Bible Translations by Publication Year" /></p>
<h3>Discussion</h3>
<p>The result accords well with my expectations of translations. It generally follows the &#8220;word for word/thought for thought&#8221; continuum often used to categorize translations, suggesting that word-for-word, functionally equivalent translations tend toward traditional language, while thought-for-thought, dynamic-equivalent translations sometimes find replacements for traditional words. For reference, here&#8217;s how Bible publisher Zondervan categorizes translations along that continuum:</p>
<p><a href="http://www.zondervan.com/Cultures/en-US/Product/Bible/Translations/About+Bible+Translations.htm?QueryStringSite=Zondervan"><img src="http://a.openbible.info/blog/2010-12-translations-continuum.png" width="800" height="186" alt="A word-for-word to thought-for-thought continuum lists about twenty English translations, from an interlinear to The Message." /></a></p>
<p>I&#8217;m not sure what to make of the curious NLT grouping in the first chart above: the five translations are more similar than any others. In particular, I&#8217;d expect the new <a href="http://www.commonenglishbible.com/">Common English Bible</a> to be more contemporary&#8211;perhaps it will become so once the Old Testament is available and it&#8217;s more comparable to other translations.</p>
<p>In the chart with publication years, notice how no one tries to occupy the same space as the NIV for twenty years until the HCSB comes along.</p>
<p>The World English Bible appears where it does largely because it uses &#8220;Yahweh&#8221; instead of &#8220;LORD.&#8221; If you ignore that word, the WEB shows up between the Amplified and the NASB. (The word <a href="http://ngrams.googlelabs.com/graph?content=Yahweh&#038;year_start=1800&#038;year_end=2008&#038;corpus=0&#038;smoothing=3">Yahweh</a> has become more popular recently.) Similarly, the New Jerusalem Bible would appear between the HCSB and the NET for the same reason.</p>
<p>The more contemporary versions often use contractions (e.g., <a href="http://ngrams.googlelabs.com/graph?content=you%27ll&#038;year_start=1800&#038;year_end=2008&#038;corpus=0&#038;smoothing=3">you&#8217;ll</a>), which pulls their score considerably toward the contemporary side.</p>
<p>Religious words (&#8220;God,&#8221; &#8220;Jesus&#8221;) pull translations to the traditional side, since a greater percentage of books in the past dealt with religious subjects. A religious text such as the Bible therefore naturally tends toward older language.</p>
<p>If you&#8217;re looking for translations largely free from copyright restrictions, most of the KJV-grouped translations are public domain. The <a href="http://www.lexhamenglishbible.com/">Lexham English Bible</a> and the <a href="http://www.ebible.org/web/">World English Bible</a> are available in the ESV/NASB group. The <a href="http://net.bible.org/">NET Bible</a> is available in the NIV group. Interestingly, all the more contemporary-style translations are under standard copyright; I don&#8217;t know of a project to produce an open thought-for-thought translation&#8211;maybe because there&#8217;s more room for disagreement in such a project?</p>
<p>Not included in the above chart is the <a href="http://www.lolcatbible.com/">LOLCat Bible</a>, a non-academic attempt to translate the Bible into <a href="http://en.wikipedia.org/wiki/Lolcat">LOLspeak</a>. If charted, it appears well to the contemporary side of The Message:</p>
<p><img src="http://a.openbible.info/blog/2010-12-translations-lol.png" width="800" height="41" alt="The KJV is on the far left, The Message is in the middle, and the LOLCat Bible is on the far right." /></p>
<h3>Methodology</h3>
<p>I downloaded the <a href="http://ngrams.googlelabs.com/datasets">English 1-gram corpus</a> from Google, normalized the words (stripping combining characters and making them case insensitive), and inserted the five million or so unique words into a database table. I combined individual years into decades to lower the row count. Next, I ran a percentage-wise comparison (similar to what <a href="http://ngrams.googlelabs.com/">Google&#8217;s ngram viewer</a> does) for each word to determine when they were most popular.</p>
<p>Then, I created word counts for a variety of translations, dropped stopwords, and multiplied the counts by the above ngram percentages to arrive at a median year for each translation.</p>
<p>The year scale (x-axis on the first chart, y-axis on the second) runs from 1838 to 1878, largely, as mentioned before, because Bibles use religious language. Even the LOLCat Bible dates to 1921 because it uses words (e.g., &#8220;ceiling cat&#8221;) that don&#8217;t particularly tie it to the present.</p>
<h3>Caveats</h3>
<p>The data doesn&#8217;t present a complete picture of a translation&#8217;s suitability for a particular audience or overall readability. For example, it doesn&#8217;t take into account word order (&#8220;fear not&#8221; vs. &#8220;do not fear&#8221;). (I wanted to use Google&#8217;s two- or three-gram data to see what differences they make, but as of this writing, Google hasn&#8217;t finished uploading them.)</p>
<p>I work for Zondervan, which publishes the NIV family of Bibles, but the work here is my own and I don&#8217;t speak for them.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/12/quantifying-traditional-vs-contemporary-language-in-english-bibles-using-google-ngram-data/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Venn Diagram of Google Bible Searches</title>
		<link>http://www.openbible.info/blog/2010/10/venn-diagram-of-google-bible-searches/</link>
		<comments>http://www.openbible.info/blog/2010/10/venn-diagram-of-google-bible-searches/#comments</comments>
		<pubDate>Tue, 26 Oct 2010 02:51:07 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Collective Intelligence]]></category>
		<category><![CDATA[Topics]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=383</guid>
		<description><![CDATA[Technomancy.org just released a Google Suggest Venn Diagram Generator, where you enter a phrase and three ways to finish it: for example, &#8220;(Bible, New Testament, Old Testament) verses on&#8230;.&#8221; It then creates a Venn diagram showing you how Google autocompletes the phrase and where the suggestions overlap. The below diagram shows the result for &#8220;(Bible, [...]]]></description>
			<content:encoded><![CDATA[<p>Technomancy.org just released a <a href="http://www.technomancy.org/google-suggest-venn/">Google Suggest Venn Diagram Generator</a>, where you enter a phrase and three ways to finish it: for example, &#8220;(Bible, New Testament, Old Testament) verses on&#8230;.&#8221; It then creates a Venn diagram showing you how Google autocompletes the phrase and where the suggestions overlap.</p>
<p>The below diagram shows the result for &#8220;<a href="http://www.technomancy.org/google-suggest-venn/#start=X+verses+on&#038;end0=Bible&#038;end1=New+Testament&#038;end2=Old+Testament">(Bible, New Testament, Old Testament) verses on&#8230;</a>.&#8221; The overlapping words&#8211;faith, hope, love, forgiveness, prayer&#8211;present a decent (though incomplete) summary of Christianity.</p>
<p><a href="http://www.technomancy.org/google-suggest-venn/#start=X+verses+on&#038;end0=Bible&#038;end1=New+Testament&#038;end2=Old+Testament"><img src="http://a.openbible.info/blog/2010-10-venn.png" width="606" height="648" alt="A Venn diagram shows completions for (X Verses on...): Bible (courage, death, friendship, patience), New Testament (divorce, homosexuality, justice, tithing), Old Testament (Jesus), NT + Bible (hope, strength), OT + Bible (faith), OT + NT (marriage), and all three (forgiveness, love, prayer)." /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/10/venn-diagram-of-google-bible-searches/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualizing Pericope Similarity in the New Testament</title>
		<link>http://www.openbible.info/blog/2010/09/visualizing-pericope-similarity-in-the-new-testament/</link>
		<comments>http://www.openbible.info/blog/2010/09/visualizing-pericope-similarity-in-the-new-testament/#comments</comments>
		<pubDate>Tue, 14 Sep 2010 01:39:27 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Linguistics]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=352</guid>
		<description><![CDATA[This diagram plots the similarity of pericopes (sections) in the New Testament based on their linguistic similarity in Greek: Blue = Gospels, Purple = Acts, Green = Paul’s Epistles, Red = General Epistles, Gray = Revelation If you don’t have Silverlight installed (or are reading this post via RSS&#8211;I suggest you click through to the [...]]]></description>
			<content:encoded><![CDATA[<p>This diagram plots the similarity of pericopes (sections) in the New Testament based on their linguistic similarity in Greek:</p>
<p><script src="http://zoom.it/zn9M.js?width=auto&#038;height=500px"></script></p>
<p>Blue = Gospels, Purple = Acts, Green = Paul’s Epistles, Red = General Epistles, Gray = Revelation</p>
<p>If you don’t have Silverlight installed (or are reading this post via RSS&#8211;I suggest you click through to the original post), here’s a thumbnail:</p>
<p><img src="http://a.openbible.info/blog/2010-09-pericopes-thumb.png" width="500" height="306" alt="Pericope similarity in the New Testament (thumbnail)." /></p>
<p>Download the <a href="http://a.openbible.info/blog/2010-09-pericopes.pdf">full-size PDF</a> (300KB) or <a href="http://a.openbible.info/blog/2010-09-pericopes.png">PNG</a> (22 MB, 12,000 pixels wide).</p>
<p>Do we actually learn anything from this kind of diagram? The most interesting part to me is how the gospels on the right flow primarily through the Gospel of John to the epistles on the left. I wonder why that is.</p>
<h3>Methodology</h3>
<p>I calculated the cosine similarity between the full text of the pericopes using the Greek lemmas (after removing about forty stopwords). The pericope titles come from the <a href="http://www.esv.org/">ESV</a>. I produced the diagram with <a href="http://www.cytoscape.org/">Cytoscape</a>. The widget at the top of the post comes from <a href="http://zoom.it">zoom.it</a>, Microsoft&#8217;s Deep-Zoom-as-a-Service.</p>
<p>Bill Mounce’s excellent <a href="http://www.greek-dictionary.net/">free New Testament Greek dictionary</a> served as the source of the lemmas.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/09/visualizing-pericope-similarity-in-the-new-testament/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bible Cross References Visualization</title>
		<link>http://www.openbible.info/blog/2010/04/bible-cross-references-visualization/</link>
		<comments>http://www.openbible.info/blog/2010/04/bible-cross-references-visualization/#comments</comments>
		<pubDate>Fri, 16 Apr 2010 11:55:09 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Cross References]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=214</guid>
		<description><![CDATA[Here&#8217;s a visualization of 340,000 Bible cross references: Larger version (2,000 x 1,600 pixels). Does anything strike you as intriguing? A few trends jump out at me: The frequency of dense New Testament streaks in the Old Testament, especially in Leviticus and Deuteronomy; I didn&#8217;t expect to see them there. The loops in Samuel / [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a visualization of <a href="http://www.openbible.info/labs/cross-references/">340,000 Bible cross references</a>:</p>
<p><a href="http://a.openbible.info/blog/2010-04-cross-references-2000.png"><img src="http://a.openbible.info/blog/2010-04-cross-references-800.jpg" width="800" height="640" alt="Visualization of Bible cross references." /></a><br />
<a href="http://a.openbible.info/blog/2010-04-cross-references-2000.png">Larger version</a> (2,000 x 1,600 pixels).</p>
<p>Does anything strike you as intriguing? A few trends jump out at me:</p>
<ol>
<li>The frequency of dense New Testament streaks in the Old Testament, especially in Leviticus and Deuteronomy; I didn&#8217;t expect to see them there.</li>
<li>The loops in Samuel / Kings / Chronicles and in the Gospels indicating parallel stories.</li>
<li>The sudden increased density of New Testament references in Psalms through Isaiah.</li>
<li>The eschatological references in Isaiah and Daniel.</li>
<li>The density of references from the Minor Prophets back to both the Major Prophets and earlier in the Old Testament.</li>
<li>The surprising density of cross references in Hebrew-Jude.</li>
<li>The asymmetry. If verse A cites verse B, verse B doesn&#8217;t necessarily cite verse A. I wonder if I should make the data symmetrical.</li>
</ol>
<p>You can also download the <a href="http://a.openbible.info/blog/2010-04-cross-references-10000.png">full-size image</a> (10,000 x 8,000 pixels, 75 MB PNG). It&#8217;s a very large image that could crash your browser. If you want it, I strongly recommend that you save it to your computer rather than trying to open it in your browser.</p>
<p>This visualization uses data from the <a href="http://www.openbible.info/labs/cross-references/">Bible Cross References</a> project. I used PHP&#8217;s GD library to create the graphic.</p>
<p>Inspired by <a href="http://www.chrisharrison.net/projects/bibleviz/">Chris Harrison</a> and Christian Swinehart&#8217;s wonderful <a href="http://samizdat.cc/cyoa/">Choose Your Own Adventure</a> work.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/04/bible-cross-references-visualization/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Presentation on Tweeting the Bible</title>
		<link>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/</link>
		<comments>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/#comments</comments>
		<pubDate>Sat, 27 Mar 2010 00:53:49 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Bible]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=195</guid>
		<description><![CDATA[Here&#8217;s a presentation I just gave at the BibleTech 2010 conference about how people tweet the Bible: Bible Tech 2010 Tweeting the Bible View more presentations from openbibleinfo. Also: PowerPoint, PDF. I distributed the following handout at the presentation, showing the popularity of Bible chapters and verses cited on Twitter. It displays a lot of [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a presentation I just gave at the <a href="http://www.bibletechconference.com/">BibleTech</a> 2010 conference about how people <a href="http://www.openbible.info/realtime/">tweet the Bible</a>:</p>
<div style="width:425px" id="__ss_3568513"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/openbibleinfo/bible-tech-2010-tweeting-the-bible" title="Bible Tech 2010 Tweeting the Bible">Bible Tech 2010 Tweeting the Bible</a></strong><object width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bibletech2010-stephensmith-tweetingthebible-100326192548-phpapp02&#038;rel=0&#038;stripped_title=bible-tech-2010-tweeting-the-bible" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bibletech2010-stephensmith-tweetingthebible-100326192548-phpapp02&#038;rel=0&#038;stripped_title=bible-tech-2010-tweeting-the-bible" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/openbibleinfo">openbibleinfo</a>.</div>
</div>
<p>Also: <a href="http://a.openbible.info/blog/2010-03-bibletech.pptx">PowerPoint</a>, <a href="http://a.openbible.info/blog/2010-03-bibletech.pdf">PDF</a>.</p>
<p>I distributed the following handout at the presentation, showing the popularity of Bible chapters and verses cited on Twitter. It displays a lot of data: darker chapters are more popular, the number in the middle of each box is the most popular verse in the chapter, and sparklines in each box show the distribution of the popularity in each chapter. (Genesis 1:1 is by far the most popular verse in Genesis 1, while Genesis 3:15 is only a little more popular than other verses in the chapter.)</p>
<p><a href="http://a.openbible.info/blog/2010-03-bibletech-big.png"><img src="http://a.openbible.info/blog/2010-03-bibletech-small.png" width="500" height="371" alt="The grid shows the popularity of chapters and verses in the Bible as cited on Twitter." /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

