<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>OpenBible.info Blog &#187; Twitter</title>
	<atom:link href="http://www.openbible.info/blog/category/twitter/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.openbible.info/blog</link>
	<description></description>
	<lastBuildDate>Wed, 08 Feb 2012 01:35:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Bible Annotation Modeling and Querying in MySQL and CouchDB</title>
		<link>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/</link>
		<comments>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/#comments</comments>
		<pubDate>Thu, 01 Sep 2011 12:03:55 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Data Modeling]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=464</guid>
		<description><![CDATA[If you’re storing people’s Bible annotations (notes, bookmarks, highlights, etc.) digitally, you want to be able to retrieve them later. Let’s look at some strategies for how to store and look up these annotations. Know What You’re Modeling First you need to understand the shape of the data. I don’t have access to a large [...]]]></description>
			<content:encoded><![CDATA[<p>If you’re storing people’s Bible annotations (notes, bookmarks, highlights, etc.) digitally, you want to be able to retrieve them later. Let’s look at some strategies for how to store and look up these annotations.</p>
<h3>Know What You’re Modeling</h3>
<p>First you need to understand the shape of the data. I don’t have access to a large repository of Bible annotations, but the Twitter and Facebook Bible citations from the <a href="http://www.openbible.info/realtime/">Realtime Bible Search</a> section of this website provide a good approximation of how people cite the Bible. (Quite a few Facebook posts appear to involve people responding to their daily devotions.) These tweets and posts are public, and private annotations may take on a slightly different form, but the general shape of the data should be similar: nearly all (99%) refer to a chapter or less.</p>
<p><a href="http://a.openbible.info/blog/2011-08-social-full.png"><img src="http://a.openbible.info/blog/2011-08-social.png" width="600" height="439" alt="Large dots at the bottom indicate many single-verse references. Chapter references are also fairly prominent. See below for more discussion." /></a></p>
<p><a href="http://www.biblegateway.com/blog/2011/08/how-people-share-the-bible-verses-vs-read-the-bible-chapters/">Compare Bible Gateway reading habits</a>, which are much heavier on chapter-level usage, but 98% of accesses still involve a chapter or less.</p>
<h3>The Numbers</h3>
<p>The data consists of about 35 million total references.</p>
<table class="data">
<tr>
<th class="number">Percent of Total</th>
<th>Description</th>
<th>Example</th>
</tr>
<tr>
<td class="number">73.5</td>
<td>Single verse</td>
<td>John 3:16</td>
</tr>
<tr>
<td class="number">17.1</td>
<td>Verse range in a single chapter</td>
<td>John 3:16-17</td>
</tr>
<tr>
<td class="number">8.4</td>
<td>Exactly one chapter</td>
<td>John 3</td>
</tr>
<tr>
<td class="number">0.7</td>
<td>Two or more chapters (at chapter boundaries)</td>
<td>John 3-4</td>
</tr>
<tr>
<td class="number">0.1</td>
<td>Verses spanning two chapters (not at chapter boundaries)</td>
<td>John 3:16-4:2</td>
</tr>
<tr>
<td class="number">0.1</td>
<td>Verses spanning three or more chapters (not at chapter boundaries)</td>
<td>John 3:16-5:2</td>
</tr>
</table>
<p>About 92.9% of posts or tweets cited only one verse or verse range; 7.1% mentioned more than one verse range. Of the latter, 77% cited exactly two verse ranges; the highest had 323 independent verse ranges. Of Facebook posts, 9.1% contained multiple verse ranges, compared to 4.2% of tweets. When there were multiple ranges, 43% of the time they referred to verses in different books from the other ranges; 39% referred to verses in the same book (but not in the same chapter); and 18% referred to verses in the same chapter. (This distribution is a unusual—normally close verses stick together.)</p>
<p>The data, oddly, doesn’t contain any references that span multiple books. Less than 0.01% of passage accesses span multiple books on Bible Gateway, which is probably a useful upper bound for this type of data.</p>
<h4>Key Points</h4>
<ol>
<li>Nearly all citations involve verses in the same chapter; only 1% involve verses in multiple chapters.</li>
<li>Of the 1% spanning two or more chapters, most refer to exact chapter boundaries.</li>
<li>Multiple-book references are even more unusual (under 0.01%) but have outsize effects: an annotation that references Genesis 1 to Revelation 22 would be relevant for every verse in the Bible.</li>
<li>Around 7% of notes contained multiple independent ranges of verses—the more text you allow for an annotation, the more likely someone is to mention multiple verses.</li>
</ol>
<h4>Download</h4>
<p><a href="http://a.openbible.info/blog/2011-08-social-lengths.zip">Download the raw social data</a> (1.4 MB zip) under the usual CC-Attribution license.</p>
<h3>Data Modeling</h3>
<p>A Bible annotation consists of arbitrary content (a highlight might have one kind of content, while a proper note might have a title, body, attachments, etc., but modeling the content itself isn&#8217;t the point of this piece) tied to one or more Bible references:</p>
<ol>
<li>A single verse (John 3:16).</li>
<li>A single range (John 3:16-17).</li>
<li>Multiple verses or ranges (John 3:16, John 3:18-19)</li>
</ol>
<h3>The Relational Model</h3>
<p>One user can have many rows of annotations, and one annotation can have many rows of verses that it refers to. To model a Bible annotation relationally, we set up three tables that look something like this:</p>
<h4>users</h4>
<table class="data">
<tr>
<th>user_id</th>
<th>name</th>
</tr>
<tr>
<td>1</td>
<td>…</td>
</tr>
</table>
<h4>annotations</h4>
<table class="data">
<tr>
<th>user_id</th>
<th>annotation_id</th>
<th>content</th>
</tr>
<tr>
<td>1</td>
<td>101</td>
<td>…</td>
</tr>
<tr>
<td>1</td>
<td>102</td>
<td>…</td>
</tr>
<tr>
<td>1</td>
<td>103</td>
<td>…</td>
</tr>
</table>
<h4>annotation_verses</h4>
<p> The verse references here are integers to allow for easy range searches: 43 = John (the 43rd book in the typical Protestant Bible); 003 = the third chapter; the last three digits = the verse number.</p>
<p>I like using this approach over others (sequential integer or separate columns for book, chapter, and verse) because it limits the need for a lookup table. (You just need to know that 43 = John, and then you can find any verse or range of verses in that book.) It also lets you find all the annotations for a particular chapter without having to know how many verses are in the chapter. (The longest chapter in the Bible has 176 verses, so you know that all the verses in John 3, for example, fall between 43003001 and 43003176.) This main disadvantage is that you don’t necessarily know how many verses you’re selecting until after you’ve selected them. And using individual columns, unlike here, does allow you to run <code>group by</code> queries to get easy counts.</p>
<table class="data">
<tr>
<th>annotation_id</th>
<th>start_verse</th>
<th>end_verse</th>
</tr>
<tr>
<td>101</td>
<td>43003016</td>
<td>43003016</td>
</tr>
<tr>
<td>102</td>
<td>43003016</td>
<td>43003017</td>
</tr>
<tr>
<td>103</td>
<td>43003016</td>
<td>43003016</td>
</tr>
<tr>
<td>103</td>
<td>43003019</td>
<td>43003020</td>
</tr>
</table>
<h3>Querying</h3>
<p>In a Bible application, the usual mode of accessing annotations is by passage: if you’re looking at John 3:16-18, you want to see all your annotations that apply to that passage.</p>
<h3>Querying MySQL</h3>
<p>In SQL terms:</p>
<p><code>select distinct(annotations.annotation_id)<br />
from annotations, annotation_verses<br />
where annotation_verses.start_verse &lt;= 43003018 and<br />
annotation_verses.end_verse &gt;= 43003016 and<br />
annotations.user_id = 1 and<br />
annotations.annotation_id = annotation_verses.annotation_id<br />
order by annotation_verses.start_verse asc, annotation_verses.end_verse desc</code></p>
<p>The quirkiest part of the SQL is the first part of the “where” clause, which at first glance looks backward: why is the last verse in the <code>start_verse</code> field and the first verse in the <code>end_verse</code> field? Because the <code>start_verse</code> and <code>end_verse</code> can span any range of verses, you need to make sure that you get any range that overlaps the verses you’re looking for: in other words, the <code>start_verse</code> is before the end of the range, and the <code>end_verse</code> is after the start.</p>
<p>Visually, you can think of each <code>start_verse</code> and <code>end_verse</code> pair as a line: if the line overlaps the shaded area you’re looking for, then it’s a relevant annotation. If not, it’s not relevant. There are six cases:</p>
<p><img src="http://a.openbible.info/blog/2011-08-before-after.png" width="516" height="277" alt="Start before, end before: John 3:15 / Start before, end inside: John 3:15-17 / Start before, end after: John 3:15-19 / Start inside, end inside: John 3:16-18 / Start inside, end after: John 3:17-19 / Start after, end after: John 3:19" /></p>
<p>The other trick in the SQL is the sort order: you generally want to see annotations in canonical order, starting with the longest range first. In other words, you start with an annotation about John 3, then to a section inside John 3, then to individual verses. In this way, you move from the broadest annotations to the narrowest annotations. You may want to switch up this order, but it makes a good default.</p>
<p>The relational approach works pretty well. If you worry about the performance implications of the SQL join, you can always put the <code>user_id</code> in <code>annotation_verses</code> or use a view or something.</p>
<h3>Querying CouchDB</h3>
<p><a href="http://couchdb.apache.org/">CouchDB</a> is one of the oldest entrants in the NoSQL space and distinguishes itself by being both a key-value store and queryable using map-reduce:  the usual way to access more than one document in a single query is to write Javascript to output the data you want. It lets you create complex keys to query by, so you might think that you can generate a key like <code>[start_verse,end_verse]</code> and query it like this: <code>?startkey=[0,43003016]&amp;endkey=[43003018,99999999]</code></p>
<p>But no. Views are one-dimensional, meaning that CouchDB doesn’t even look at the second element in the key if the first one matches the query. For example, an annotation with both a start and end verse of <code>19001001</code> matches the above query, which isn’t useful for this purpose.</p>
<p>I can think of two ways to get around this limitation, both of which have drawbacks.</p>
<h4>GeoCouch</h4>
<p>CouchDB has a plugin called GeoCouch that lets you query geographic data, which actually maps well to this data model. (I didn’t come up with this approach on my own: see <a href="http://www.diretto.org/2010/08/efficient-time-based-range-queries-in-couchdb-using-geocouch/">Efficient Time-based Range Queries in CouchDB using GeoCouch</a> for the background.)</p>
<p>The basic idea is to treat each <code>start_verse,end_verse</code> pair as a point on a two-dimensional grid. Here’s the above social data plotted this way:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid.png" width="600" height="551" alt="A diagonal line starts in the bottom left corner and continues to the top right. Large dots indicate popular verses, and book outlines are visible." /></p>
<p>The line bisects the grid diagonally since an <code>end_verse</code> never precedes a <code>start_verse</code>: the diagonal line where <code>start_verse = end_verse</code> indicates the lower bound of any reference. Here are some points indicating where ranges fall on the plot:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid-points.png" width="600" height="554" alt="This chart looks the same as the previous one but has points marked to illustrate that longer ranges are farther away from the bisecting line." /></p>
<p>To find all the annotations relevant to John 3:16-18, we draw a region starting in the upper left and continuing to the point <code>43003018,43003016</code>:</p>
<p><img src="http://a.openbible.info/blog/2011-08-social-grid-bbox.png" width="600" height="551" alt="This chart looks the same as the previous one but has a box from the top left ending just above and past the beginning of John near the upper right of the chart." /></p>
<p>GeoCouch allows exactly this kind of bounding-box query: <code>?bbox=0,43003016,43003018,99999999</code></p>
<p>You can even support multiple users in this scheme: just give everyone their own, independent box. I might occupy 1&#215;1 (with an annotation at <code>1.43003016,1.43003016</code>), while you might occupy 2&#215;2 (with an annotation at <code>2.43003016,2.43003016</code>); queries for our annotations would never overlap. Each whole number to the left of the decimal acts as a namespace.</p>
<p>The drawbacks:</p>
<ol>
<li>The results aren’t sorted in a useful way. You’ll need to do sorting on the client side or in a <a href="http://guide.couchdb.org/editions/1/en/show.html">show function</a>.</li>
<li>You don’t get pagination.</li>
</ol>
<h4>Repetition at Intervals</h4>
<p>Given the shape of the data, which is overwhelmingly chapter-bound (and lookups, which at least on Bible Gateway are chapter-based), you could simply repeat chapter-spanning annotations at the beginning of every chapter. In the worst case annotation (Genesis 1-Revelation 22), you end up with about 1200 repetitions.</p>
<p>For example, in the Genesis-Revelation case, for John 3 you might create a key like <code>[43000000.01001001,66022021]</code> so that it sorts at the beginning of the chapter—and if you have multiple annotations with different start verses, they stay sorted properly.</p>
<p>To get annotations for John 3:16-18, you’d query for <code>?startkey=[43003000]&amp;endkey=[43003018,{}]</code></p>
<p>The drawbacks:</p>
<ol>
<li>You have to filter out all the irrelevant annotations: if you have a lot of annotations about John 3:14, you have to skip through them all before you get to the ones about John 3:16.</li>
<li>You have to filter out duplicates when the range you’re querying for spans multiple chapters.</li>
<li>You’re repeating yourself, though given how rarely a multi-chapter span (let alone a multi-book span) happens in the wild, it might not matter that much.</li>
</ol>
<h4>Other CouchDB Approaches</h4>
<p>Both these approaches assume that you want to make only one query to retrieve the data. If you’re willing to make multiple queries, you could create different list functions and query them in parallel: for example, you could have one for single-chapter annotations and one for multi-chapter annotations. See <a href="http://en.wikipedia.org/wiki/Interval_tree">interval trees</a> and <a href="http://en.wikipedia.org/wiki/Geohash">geohashes</a> for additional ideas. You could also introduce a separate query layer, such as <a href="http://www.elasticsearch.org/">elasticsearch</a>, to sit <a href="http://www.elasticsearch.org/guide/reference/river/couchdb.html">on top of CouchDB</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/09/bible-annotation-modeling-and-querying-in-mysql-and-couchdb/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Twitterers Are Giving up for Lent (2011 Edition)</title>
		<link>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/</link>
		<comments>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/#comments</comments>
		<pubDate>Fri, 11 Mar 2011 01:38:10 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=432</guid>
		<description><![CDATA[Congratulations, I guess, go this year to Charlie Sheen, who came in at both #23 and, with &#8220;tiger blood,&#8221; at #90. Justin Bieber is up several spots this year, so he hasn&#8217;t quite crested yet. The next-highest celebrity, who didn&#8217;t make the top 100, is British boy band One Direction. &#8220;Trophies,&#8221; at #69, refers to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://a.openbible.info/blog/2011-03-lent-big.png"><img src="http://a.openbible.info/blog/2011-03-lent.png" width="800" height="422" alt="The top 100 things that people on Twitter are giving up for Lent in 2011." /></a></p>
<p>Congratulations, I guess, go this year to Charlie Sheen, who came in at both #23 and, with &#8220;tiger blood,&#8221; at #90. Justin Bieber is up several spots this year, so he hasn&#8217;t quite crested yet. The next-highest celebrity, who didn&#8217;t make the top 100, is British boy band <a href="http://twitter.com/onedirection">One Direction</a>.</p>
<p>&#8220;Trophies,&#8221; at #69, refers to the English soccer club <a href="http://en.wikipedia.org/wiki/Arsenal_F.C.">Arsenal</a>&#8216;s recent defeat, or something.</p>
<p>The later start to Lent this year means that &#8220;snow&#8221; doesn&#8217;t appear on the list&#8211;<a href="http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/">last year</a>, it was #48. Myspace hangs on at #99, dropping 48 places.</p>
<p>This list draws from 85,000 tweets from March 7-10, 2011, and excludes retweets.</p>
<table class="data">
<tr>
<th>Rank</th>
<th>Word</th>
<th>Count</th>
<th>Change from last year&#8217;s rank</th>
</tr>
<tr>
<td>1.</td>
<td>Twitter</td>
<td>4297</td>
<td>0</td>
</tr>
<tr>
<td>2.</td>
<td>Facebook</td>
<td>4060</td>
<td>0</td>
</tr>
<tr>
<td>3.</td>
<td>Chocolate</td>
<td>3185</td>
<td>0</td>
</tr>
<tr>
<td>4.</td>
<td>Swearing</td>
<td>2527</td>
<td>+1</td>
</tr>
<tr>
<td>5.</td>
<td>Alcohol</td>
<td>2347</td>
<td>-1</td>
</tr>
<tr>
<td>6.</td>
<td>Sex</td>
<td>2093</td>
<td>+3</td>
</tr>
<tr>
<td>7.</td>
<td>Soda</td>
<td>1959</td>
<td>-1</td>
</tr>
<tr>
<td>8.</td>
<td>Lent</td>
<td>1493</td>
<td>-1</td>
</tr>
<tr>
<td>9.</td>
<td>Meat</td>
<td>1352</td>
<td>-1</td>
</tr>
<tr>
<td>10.</td>
<td>Fast food</td>
<td>1303</td>
<td>0</td>
</tr>
<tr>
<td>11.</td>
<td>Sweets</td>
<td>1252</td>
<td>0</td>
</tr>
<tr>
<td>12.</td>
<td>Giving up things</td>
<td>778</td>
<td>+7</td>
</tr>
<tr>
<td>13.</td>
<td>School</td>
<td>768</td>
<td>+27</td>
</tr>
<tr>
<td>14.</td>
<td>Religion</td>
<td>745</td>
<td>+1</td>
</tr>
<tr>
<td>15.</td>
<td>Coffee</td>
<td>707</td>
<td>-3</td>
</tr>
<tr>
<td>16.</td>
<td>You</td>
<td>675</td>
<td>+6</td>
</tr>
<tr>
<td>17.</td>
<td>Social networking</td>
<td>665</td>
<td>+15</td>
</tr>
<tr>
<td>18.</td>
<td>Chips</td>
<td>664</td>
<td>+3</td>
</tr>
<tr>
<td>19.</td>
<td>Junk food</td>
<td>594</td>
<td>-1</td>
</tr>
<tr>
<td>20.</td>
<td>Bread</td>
<td>571</td>
<td>+6</td>
</tr>
<tr>
<td>21.</td>
<td>Smoking</td>
<td>555</td>
<td>-4</td>
</tr>
<tr>
<td>22.</td>
<td>Candy</td>
<td>541</td>
<td>-8</td>
</tr>
<tr>
<td>23.</td>
<td>Charlie Sheen</td>
<td>511</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>24.</td>
<td>Work</td>
<td>482</td>
<td>+4</td>
</tr>
<tr>
<td>25.</td>
<td>Stuff</td>
<td>467</td>
<td>-2</td>
</tr>
<tr>
<td>26.</td>
<td>Catholicism</td>
<td>436</td>
<td>-10</td>
</tr>
<tr>
<td>27.</td>
<td>Food</td>
<td>395</td>
<td>+3</td>
</tr>
<tr>
<td>28.</td>
<td>Shopping</td>
<td>363</td>
<td>+1</td>
</tr>
<tr>
<td>29.</td>
<td>Marijuana</td>
<td>358</td>
<td>+31</td>
</tr>
<tr>
<td>30.</td>
<td>Beer</td>
<td>346</td>
<td>-10</td>
</tr>
<tr>
<td>31.</td>
<td>Fried food</td>
<td>307</td>
<td>-7</td>
</tr>
<tr>
<td>32.</td>
<td>Homework</td>
<td>306</td>
<td>+27</td>
</tr>
<tr>
<td>33.</td>
<td>Cheese</td>
<td>297</td>
<td>+4</td>
</tr>
<tr>
<td>34.</td>
<td>Cookies</td>
<td>293</td>
<td>+11</td>
</tr>
<tr>
<td>35.</td>
<td>Red meat</td>
<td>285</td>
<td>-10</td>
</tr>
<tr>
<td>36.</td>
<td>Masturbation</td>
<td>285</td>
<td>+8</td>
</tr>
<tr>
<td>37.</td>
<td>Virginity</td>
<td>253</td>
<td>+26</td>
</tr>
<tr>
<td>38.</td>
<td>Pancakes</td>
<td>252</td>
<td>+20</td>
</tr>
<tr>
<td>39.</td>
<td>Rice</td>
<td>236</td>
<td>-5</td>
</tr>
<tr>
<td>40.</td>
<td>Booze</td>
<td>235</td>
<td>+2</td>
</tr>
<tr>
<td>41.</td>
<td>Coke</td>
<td>234</td>
<td>-3</td>
</tr>
<tr>
<td>42.</td>
<td>Boys</td>
<td>229</td>
<td>+24</td>
</tr>
<tr>
<td>43.</td>
<td>Sugar</td>
<td>229</td>
<td>-16</td>
</tr>
<tr>
<td>44.</td>
<td>Sobriety</td>
<td>226</td>
<td>+10</td>
</tr>
<tr>
<td>45.</td>
<td>Procrastination</td>
<td>226</td>
<td>-10</td>
</tr>
<tr>
<td>46.</td>
<td>Nothing</td>
<td>219</td>
<td>+21</td>
</tr>
<tr>
<td>47.</td>
<td>Winning</td>
<td>219</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>48.</td>
<td>Ice cream</td>
<td>211</td>
<td>-7</td>
</tr>
<tr>
<td>49.</td>
<td>Caffeine</td>
<td>203</td>
<td>-16</td>
</tr>
<tr>
<td>50.</td>
<td>McDonald&#8217;s</td>
<td>195</td>
<td>+27</td>
</tr>
<tr>
<td>51.</td>
<td>Church</td>
<td>188</td>
<td>+28</td>
</tr>
<tr>
<td>52.</td>
<td>Wine</td>
<td>188</td>
<td>-3</td>
</tr>
<tr>
<td>53.</td>
<td>TV</td>
<td>184</td>
<td>-7</td>
</tr>
<tr>
<td>54.</td>
<td>Starbucks</td>
<td>183</td>
<td>-15</td>
</tr>
<tr>
<td>55.</td>
<td>Texting</td>
<td>182</td>
<td>-12</td>
</tr>
<tr>
<td>56.</td>
<td>Liquor</td>
<td>181</td>
<td>-1</td>
</tr>
<tr>
<td>57.</td>
<td>Negativity</td>
<td>180</td>
<td>+26</td>
</tr>
<tr>
<td>58.</td>
<td>Carbs</td>
<td>179</td>
<td>+10</td>
</tr>
<tr>
<td>59.</td>
<td>Christianity</td>
<td>177</td>
<td>-12</td>
</tr>
<tr>
<td>60.</td>
<td>Justin Bieber</td>
<td>176</td>
<td>+9</td>
</tr>
<tr>
<td>61.</td>
<td>Pizza</td>
<td>175</td>
<td>-11</td>
</tr>
<tr>
<td>62.</td>
<td>French fries</td>
<td>159</td>
<td>+2</td>
</tr>
<tr>
<td>63.</td>
<td>Me</td>
<td>157</td>
<td>+9</td>
</tr>
<tr>
<td>64.</td>
<td>Losing</td>
<td>155</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>65.</td>
<td>Men</td>
<td>152</td>
<td>-13</td>
</tr>
<tr>
<td>66.</td>
<td>Fizzy drinks</td>
<td>151</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>67.</td>
<td>Porn</td>
<td>147</td>
<td>+4</td>
</tr>
<tr>
<td>68.</td>
<td>Lint</td>
<td>147</td>
<td>-11</td>
</tr>
<tr>
<td>69.</td>
<td>Trophies</td>
<td>144</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>70.</td>
<td>Tumblr</td>
<td>144</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>71.</td>
<td>Desserts</td>
<td>142</td>
<td>-15</td>
</tr>
<tr>
<td>72.</td>
<td>Chicken</td>
<td>140</td>
<td>+15</td>
</tr>
<tr>
<td>73.</td>
<td>Pork</td>
<td>139</td>
<td>-3</td>
</tr>
<tr>
<td>74.</td>
<td>Cake</td>
<td>132</td>
<td>+8</td>
</tr>
<tr>
<td>75.</td>
<td>Tea</td>
<td>127</td>
<td>+19</td>
</tr>
<tr>
<td>76.</td>
<td>Sarcasm</td>
<td>127</td>
<td>+14</td>
</tr>
<tr>
<td>77.</td>
<td>Diet Coke</td>
<td>119</td>
<td>-16</td>
</tr>
<tr>
<td>78.</td>
<td>Laziness</td>
<td>118</td>
<td>-13</td>
</tr>
<tr>
<td>79.</td>
<td>Sleep</td>
<td>117</td>
<td>-6</td>
</tr>
<tr>
<td>80.</td>
<td>Jesus</td>
<td>115</td>
<td>-4</td>
</tr>
<tr>
<td>81.</td>
<td>College</td>
<td>111</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>82.</td>
<td>Internet</td>
<td>110</td>
<td>-46</td>
</tr>
<tr>
<td>83.</td>
<td>Complaining</td>
<td>108</td>
<td>-9</td>
</tr>
<tr>
<td>84.</td>
<td>Breathing</td>
<td>103</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>85.</td>
<td>Takeout</td>
<td>98</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>86.</td>
<td>Beef</td>
<td>98</td>
<td>-8</td>
</tr>
<tr>
<td>87.</td>
<td>People</td>
<td>96</td>
<td>+11</td>
</tr>
<tr>
<td>88.</td>
<td>New Year&#8217;s resolutions</td>
<td>96</td>
<td>+1</td>
</tr>
<tr>
<td>89.</td>
<td>Him</td>
<td>94</td>
<td>-5</td>
</tr>
<tr>
<td>90.</td>
<td>Tiger blood</td>
<td>92</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>91.</td>
<td>Makeup</td>
<td>91</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>92.</td>
<td>Juice</td>
<td>90</td>
<td>-7</td>
</tr>
<tr>
<td>93.</td>
<td>Clothes</td>
<td>89</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>94.</td>
<td>My phone</td>
<td>88</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>95.</td>
<td>God</td>
<td>87</td>
<td>-15</td>
</tr>
<tr>
<td>96.</td>
<td>Abstinence</td>
<td>85</td>
<td>-15</td>
</tr>
<tr>
<td>97.</td>
<td>Stress</td>
<td>84</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>98.</td>
<td>Chipotle</td>
<td>82</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>99.</td>
<td>Myspace</td>
<td>81</td>
<td>-48</td>
</tr>
<tr>
<td>100.</td>
<td>Eating out</td>
<td>81</td>
<td>-25</td>
</tr>
</table>
<p>Image created using <a href="http://www.wordle.net/">Wordle</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2011/03/what-twitterers-are-giving-up-for-lent-2011-edition/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Presentation on Tweeting the Bible</title>
		<link>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/</link>
		<comments>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/#comments</comments>
		<pubDate>Sat, 27 Mar 2010 00:53:49 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Bible]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=195</guid>
		<description><![CDATA[Here&#8217;s a presentation I just gave at the BibleTech 2010 conference about how people tweet the Bible: Bible Tech 2010 Tweeting the Bible View more presentations from openbibleinfo. Also: PowerPoint, PDF. I distributed the following handout at the presentation, showing the popularity of Bible chapters and verses cited on Twitter. It displays a lot of [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a presentation I just gave at the <a href="http://www.bibletechconference.com/">BibleTech</a> 2010 conference about how people <a href="http://www.openbible.info/realtime/">tweet the Bible</a>:</p>
<div style="width:425px" id="__ss_3568513"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/openbibleinfo/bible-tech-2010-tweeting-the-bible" title="Bible Tech 2010 Tweeting the Bible">Bible Tech 2010 Tweeting the Bible</a></strong><object width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bibletech2010-stephensmith-tweetingthebible-100326192548-phpapp02&#038;rel=0&#038;stripped_title=bible-tech-2010-tweeting-the-bible" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=bibletech2010-stephensmith-tweetingthebible-100326192548-phpapp02&#038;rel=0&#038;stripped_title=bible-tech-2010-tweeting-the-bible" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/openbibleinfo">openbibleinfo</a>.</div>
</div>
<p>Also: <a href="http://a.openbible.info/blog/2010-03-bibletech.pptx">PowerPoint</a>, <a href="http://a.openbible.info/blog/2010-03-bibletech.pdf">PDF</a>.</p>
<p>I distributed the following handout at the presentation, showing the popularity of Bible chapters and verses cited on Twitter. It displays a lot of data: darker chapters are more popular, the number in the middle of each box is the most popular verse in the chapter, and sparklines in each box show the distribution of the popularity in each chapter. (Genesis 1:1 is by far the most popular verse in Genesis 1, while Genesis 3:15 is only a little more popular than other verses in the chapter.)</p>
<p><a href="http://a.openbible.info/blog/2010-03-bibletech-big.png"><img src="http://a.openbible.info/blog/2010-03-bibletech-small.png" width="500" height="371" alt="The grid shows the popularity of chapters and verses in the Bible as cited on Twitter." /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/03/presentation-on-tweeting-the-bible/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Delving into Lent Data</title>
		<link>http://www.openbible.info/blog/2010/03/delving-into-lent-data/</link>
		<comments>http://www.openbible.info/blog/2010/03/delving-into-lent-data/#comments</comments>
		<pubDate>Sun, 07 Mar 2010 21:38:33 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=187</guid>
		<description><![CDATA[Let’s look a little more at some of the data on what Twitterers are giving up for Lent. Categories of Things Given up by Location As I only track in English what people are giving up, there are concentrations in English-speaking countries. Size indicates the relative number of Twitterers in each country giving up something [...]]]></description>
			<content:encoded><![CDATA[<p>Let’s look a little more at some of the data on what <a href="http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/">Twitterers are giving up for Lent</a>.</p>
<h3>Categories of Things Given up by Location</h3>
<p>As I only track in English what people are giving up, there are concentrations in English-speaking countries.</p>
<p><a href="http://a.openbible.info/blog/2010-03-country-big.png"><img src="http://a.openbible.info/blog/2010-03-country.png" width="800" height="422" alt="Categories by Country" /></a><br />
Size indicates the relative number of Twitterers in each country giving up something for Lent.</p>
<p><a href="http://a.openbible.info/blog/2010-03-location-big.png"><img src="http://a.openbible.info/blog/2010-03-location.png" width="800" height="396" alt="Categories by Location" /></a></p>
<h3>Categories of Things Given up by State</h3>
<p>These visualizations show the differences (or lack thereof) in what people are giving up among U.S. states.</p>
<p><a href="http://a.openbible.info/blog/2010-03-state-big.png"><img src="http://a.openbible.info/blog/2010-03-state.png" width="800" height="421" alt="Categories by State" /></a><br />
Size indicates the relative number of Twitterers in each state giving up something for Lent. Sorry, Alaska and Hawaii.</p>
<p><a href="http://a.openbible.info/blog/2010-03-state-bar-big.png"><img src="http://a.openbible.info/blog/2010-03-state-bar.png" width="800" height="505" alt="Categories by State (%)" /></a><br />
The composition of each state’s categories of tweets shows mostly minor variations among states. Some states (like Wyoming on the far right) have small numbers of tweets. I would have liked to use opacity or width to indicate this disparity but couldn&#8217;t figure out how to do it.</p>
<h3>Comparison between 2009 and 2010</h3>
<p>This treemap shows how the data changed between 2009 and 2010. The size of the box shows the number of people giving up each category and thing, while color indicates the percentage change from last year: dark blue indicates the steepest drop; dark orange indicates the steepest rise. The second chart shows the same data more conventionally expressed.</p>
<p><a href="http://a.openbible.info/blog/2010-03-treemap-big.png"><img src="http://a.openbible.info/blog/2010-03-treemap.png" width="800" height="328" alt="Categories and Terms: Term Changes: 2009-2010" /></a></p>
<p><img src="http://a.openbible.info/blog/2010-03-heatmap.png" width="675" height="1244" alt="Categories and Terms: Term Changes: 2009-2010" /></p>
<h3>About the Visualizations</h3>
<p>I created these charts mostly to explore how the new data-analysis software <a href="http://tableausoftware.com/public/">Tableau Public</a> works. One of its claims to fame is that you can publish interactive visualizations to the web, a feature I didn’t take advantage of here. Tableau doesn’t do treemaps, so I used <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/">Many Eyes</a> to create the treemap; the closest Tableau equivalent appears below the treemap.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/03/delving-into-lent-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What Twitterers Are Giving up for Lent (2010 Edition)</title>
		<link>http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/</link>
		<comments>http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 00:27:05 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=171</guid>
		<description><![CDATA[Snow makes the list this year, understandable given the Snowpocalypse and Snowmageddon that gripped much of the Eastern U.S. in the weeks preceding Ash Wednesday. IPods also made the list after the Bishop of Liverpool asked people to consider praying instead of listening to them. This year a celebrity, Justin Bieber, cracks the top 100. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://a.openbible.info/blog/2010-02-lent-big.png"><img src="http://a.openbible.info/blog/2010-02-lent.png" alt="The top 100 things that Twitterers are giving up for Lent in 2010." width="800" height="408" /></a></p>
<p>Snow makes the list this year, understandable given the <a href="http://en.wikipedia.org/wiki/Snowpocalypse">Snowpocalypse and Snowmageddon</a> that gripped much of the Eastern U.S. in the weeks preceding Ash Wednesday. IPods also made the list after the Bishop of Liverpool asked people to consider <a href="http://news.bbc.co.uk/local/liverpool/hi/people_and_places/religion_and_ethics/newsid_8520000/8520453.stm">praying instead of listening</a> to them. This year a celebrity, Justin Bieber, cracks the top 100. He beat out the Jonas Brothers, 64 votes to 11; draw your own conclusions.</p>
<p>The list largely tracks <a href="http://www.openbible.info/blog/2009/02/top-100-things-twitterers-are-giving-up-for-lent/">last year&#8217;s list</a>. It draws from 40,000 tweets retrieved February 14-20, 2010.</p>
<h3>Complete List of the Top 100</h3>
<table class="data">
<tr>
<th>Rank</th>
<th>Word</th>
<th>Count</th>
<th>Change from last year&#8217;s rank</th>
</tr>
<tr>
<td>1.</td>
<td>Twitter</td>
<td>2089</td>
<td>+1</td>
</tr>
<tr>
<td>2.</td>
<td>Facebook</td>
<td>1874</td>
<td>-1</td>
</tr>
<tr>
<td>3.</td>
<td>Chocolate</td>
<td>1323</td>
<td>0</td>
</tr>
<tr>
<td>4.</td>
<td>Alcohol</td>
<td>1258</td>
<td>+1</td>
</tr>
<tr>
<td>5.</td>
<td>Swearing</td>
<td>1158</td>
<td>+5</td>
</tr>
<tr>
<td>6.</td>
<td>Soda</td>
<td>1126</td>
<td>0</td>
</tr>
<tr>
<td>7.</td>
<td>Lent</td>
<td>792</td>
<td>-3</td>
</tr>
<tr>
<td>8.</td>
<td>Meat</td>
<td>720</td>
<td>0</td>
</tr>
<tr>
<td>9.</td>
<td>Sex</td>
<td>701</td>
<td>+7</td>
</tr>
<tr>
<td>10.</td>
<td>Fast food</td>
<td>695</td>
<td>+7</td>
</tr>
<tr>
<td>11.</td>
<td>Sweets</td>
<td>627</td>
<td>0</td>
</tr>
<tr>
<td>12.</td>
<td>Coffee</td>
<td>445</td>
<td>-5</td>
</tr>
<tr>
<td>13.</td>
<td>iPod</td>
<td>437</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>14.</td>
<td>Candy</td>
<td>325</td>
<td>+18</td>
</tr>
<tr>
<td>15.</td>
<td>Religion</td>
<td>305</td>
<td>-6</td>
</tr>
<tr>
<td>16.</td>
<td>Catholicism</td>
<td>264</td>
<td>-4</td>
</tr>
<tr>
<td>17.</td>
<td>Smoking</td>
<td>254</td>
<td>+5</td>
</tr>
<tr>
<td>18.</td>
<td>Junk food</td>
<td>251</td>
<td>+34</td>
</tr>
<tr>
<td>19.</td>
<td>Giving up things</td>
<td>241</td>
<td>-6</td>
</tr>
<tr>
<td>20.</td>
<td>Beer</td>
<td>241</td>
<td>-5</td>
</tr>
<tr>
<td>21.</td>
<td>Chips</td>
<td>234</td>
<td>+24</td>
</tr>
<tr>
<td>22.</td>
<td>You</td>
<td>233</td>
<td>+13</td>
</tr>
<tr>
<td>23.</td>
<td>Stuff</td>
<td>217</td>
<td>-3</td>
</tr>
<tr>
<td>24.</td>
<td>Fried food</td>
<td>199</td>
<td>+33</td>
</tr>
<tr>
<td>25.</td>
<td>Red meat</td>
<td>193</td>
<td>+19</td>
</tr>
<tr>
<td>26.</td>
<td>Bread</td>
<td>187</td>
<td>+13</td>
</tr>
<tr>
<td>27.</td>
<td>Sugar</td>
<td>183</td>
<td>-8</td>
</tr>
<tr>
<td>28.</td>
<td>Work</td>
<td>176</td>
<td>-14</td>
</tr>
<tr>
<td>29.</td>
<td>Shopping</td>
<td>174</td>
<td>+11</td>
</tr>
<tr>
<td>30.</td>
<td>Food</td>
<td>162</td>
<td>-7</td>
</tr>
<tr>
<td>31.</td>
<td>Shame</td>
<td>150</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>32.</td>
<td>Social networking</td>
<td>147</td>
<td>-2</td>
</tr>
<tr>
<td>33.</td>
<td>Caffeine</td>
<td>136</td>
<td>-6</td>
</tr>
<tr>
<td>34.</td>
<td>Rice</td>
<td>136</td>
<td>+44</td>
</tr>
<tr>
<td>35.</td>
<td>Procrastination</td>
<td>127</td>
<td>-11</td>
</tr>
<tr>
<td>36.</td>
<td>Internet</td>
<td>126</td>
<td>-11</td>
</tr>
<tr>
<td>37.</td>
<td>Cheese</td>
<td>120</td>
<td>+1</td>
</tr>
<tr>
<td>38.</td>
<td>Coke</td>
<td>120</td>
<td>+41</td>
</tr>
<tr>
<td>39.</td>
<td>Starbucks</td>
<td>119</td>
<td>+14</td>
</tr>
<tr>
<td>40.</td>
<td>School</td>
<td>118</td>
<td>+36</td>
</tr>
<tr>
<td>41.</td>
<td>Ice cream</td>
<td>118</td>
<td>+13</td>
</tr>
<tr>
<td>42.</td>
<td>Booze</td>
<td>117</td>
<td>-21</td>
</tr>
<tr>
<td>43.</td>
<td>Texting</td>
<td>114</td>
<td>+28</td>
</tr>
<tr>
<td>44.</td>
<td>Masturbation</td>
<td>111</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>45.</td>
<td>Cookies</td>
<td>110</td>
<td>+11</td>
</tr>
<tr>
<td>46.</td>
<td>TV</td>
<td>97</td>
<td>-18</td>
</tr>
<tr>
<td>47.</td>
<td>Christianity</td>
<td>96</td>
<td>0</td>
</tr>
<tr>
<td>48.</td>
<td>Snow</td>
<td>96</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>49.</td>
<td>Wine</td>
<td>92</td>
<td>-13</td>
</tr>
<tr>
<td>50.</td>
<td>Pizza</td>
<td>91</td>
<td>+12</td>
</tr>
<tr>
<td>51.</td>
<td>MySpace</td>
<td>91</td>
<td>+4</td>
</tr>
<tr>
<td>52.</td>
<td>Men</td>
<td>90</td>
<td>+31</td>
</tr>
<tr>
<td>53.</td>
<td>Giving up</td>
<td>89</td>
<td>-19</td>
</tr>
<tr>
<td>54.</td>
<td>Sobriety</td>
<td>89</td>
<td>-13</td>
</tr>
<tr>
<td>55.</td>
<td>Liquor</td>
<td>87</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>56.</td>
<td>Desserts</td>
<td>87</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>57.</td>
<td>Lint</td>
<td>87</td>
<td>-20</td>
</tr>
<tr>
<td>58.</td>
<td>Pancakes</td>
<td>82</td>
<td>-29</td>
</tr>
<tr>
<td>59.</td>
<td>Homework</td>
<td>81</td>
<td>+28</td>
</tr>
<tr>
<td>60.</td>
<td>Marijuana</td>
<td>80</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>61.</td>
<td>Diet Coke</td>
<td>80</td>
<td>-28</td>
</tr>
<tr>
<td>62.</td>
<td>Hope</td>
<td>78</td>
<td>+15</td>
</tr>
<tr>
<td>63.</td>
<td>Virginity</td>
<td>76</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>64.</td>
<td>French fries</td>
<td>75</td>
<td>-15</td>
</tr>
<tr>
<td>65.</td>
<td>Laziness</td>
<td>71</td>
<td>+5</td>
</tr>
<tr>
<td>66.</td>
<td>Boys</td>
<td>67</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>67.</td>
<td>Nothing</td>
<td>67</td>
<td>-19</td>
</tr>
<tr>
<td>68.</td>
<td>Carbs</td>
<td>66</td>
<td>-4</td>
</tr>
<tr>
<td>69.</td>
<td>Justin Bieber</td>
<td>64</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>70.</td>
<td>Pork</td>
<td>64</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>71.</td>
<td>Porn</td>
<td>63</td>
<td>+9</td>
</tr>
<tr>
<td>72.</td>
<td>Me</td>
<td>62</td>
<td>0</td>
</tr>
<tr>
<td>73.</td>
<td>Sleep</td>
<td>61</td>
<td>-42</td>
</tr>
<tr>
<td>74.</td>
<td>Complaining</td>
<td>58</td>
<td>-16</td>
</tr>
<tr>
<td>75.</td>
<td>Eating out</td>
<td>58</td>
<td>-8</td>
</tr>
<tr>
<td>76.</td>
<td>Jesus</td>
<td>55</td>
<td>-26</td>
</tr>
<tr>
<td>77.</td>
<td>McDonald&#8217;s</td>
<td>55</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>78.</td>
<td>Beef</td>
<td>54</td>
<td>+18</td>
</tr>
<tr>
<td>79.</td>
<td>Church</td>
<td>54</td>
<td>+6</td>
</tr>
<tr>
<td>80.</td>
<td>God</td>
<td>53</td>
<td>-21</td>
</tr>
<tr>
<td>81.</td>
<td>Abstinence</td>
<td>53</td>
<td>-39</td>
</tr>
<tr>
<td>82.</td>
<td>Cake</td>
<td>52</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>83.</td>
<td>Negativity</td>
<td>52</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>84.</td>
<td>Him</td>
<td>49</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>85.</td>
<td>Juice</td>
<td>47</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>86.</td>
<td>Celibacy</td>
<td>44</td>
<td>+13</td>
</tr>
<tr>
<td>87.</td>
<td>Chicken</td>
<td>42</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>88.</td>
<td>Lying</td>
<td>42</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>89.</td>
<td>New Year&#8217;s resolutions</td>
<td>42</td>
<td>-29</td>
</tr>
<tr>
<td>90.</td>
<td>Sarcasm</td>
<td>42</td>
<td>-39</td>
</tr>
<tr>
<td>91.</td>
<td>Snacking</td>
<td>41</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>92.</td>
<td>My wife</td>
<td>39</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>93.</td>
<td>Tea</td>
<td>37</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>94.</td>
<td>iPhone</td>
<td>37</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>95.</td>
<td>Exercise</td>
<td>36</td>
<td>-6</td>
</tr>
<tr>
<td>96.</td>
<td>Sweet tea</td>
<td>35</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>97.</td>
<td>People</td>
<td>35</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>98.</td>
<td>Vegetables</td>
<td>34</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>99.</td>
<td>Pasta</td>
<td>33</td>
<td>&nbsp;</td>
</tr>
<tr>
<td>100.</td>
<td>Self control</td>
<td>33</td>
<td>&nbsp;</td>
</tr>
</table>
<p>Image created using <a href="http://www.wordle.net/">Wordle</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2010/02/what-twitterers-are-giving-up-for-lent-2010-edition/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>New Feature: Search for Bible Verses on Twitter</title>
		<link>http://www.openbible.info/blog/2009/11/new-feature-search-for-bible-verses-on-twitter/</link>
		<comments>http://www.openbible.info/blog/2009/11/new-feature-search-for-bible-verses-on-twitter/#comments</comments>
		<pubDate>Tue, 01 Dec 2009 02:54:15 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=150</guid>
		<description><![CDATA[Search over 1.2 million Bible verses on Twitter&#8211;nearly every tweet that has mentioned a Bible verse since April 2009. You can also see a list of the most popular verses on Twitter over the past few hours (“Trending Verses”). This project uses several APIs from Twitter and is still in a beta stage. It could [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.openbible.info/realtime/">Search over 1.2 million Bible verses on Twitter</a>&#8211;nearly every tweet that has mentioned a Bible verse since April 2009. You can also see a list of the most popular verses on Twitter over the past few hours (“Trending Verses”).</p>
<p><a href="http://www.openbible.info/realtime/"><img src="http://a.openbible.info/blog/2009-11-realtime.png" width="500" height="472" alt="Search Bible verses on Twitter." /></a></p>
<p>This project uses several APIs from Twitter and is still in a beta stage. It could evolve in several directions, but I want to see how people use it before developing it further.</p>
<p>It&#8217;s not quite realtime, but the most recent tweet is rarely more than a few minutes old.</p>
<p>Behind the scenes, it <a href="http://www.openbible.info/blog/2009/10/top-100-linguistic-indicators-of-bible-related-tweets/">processes tweets</a> to try to ensure their relevance; it has about a 92% accuracy rate based on a training corpus of around 45,000 tweets. Use the &#8220;relevant&#8221; and &#8220;not relevant&#8221; buttons in the interface if you see a tweet that you think should or shouldn&#8217;t belong. (I&#8217;m mostly interested in the latter, but it seems weird not to have both&#8211;like Facebook&#8217;s <a href="http://news.cnet.com/8301-27076_3-10391043-248.html">lack of an unlike button</a>.)</p>
<p>It currently uses <a href="http://www.logos.com/reftagger">Logos RefTagger</a> to link the Bible references in the tweets.</p>
<p>Feel free to leave a comment here if you have a feature idea or want to make any suggestions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2009/11/new-feature-search-for-bible-verses-on-twitter/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Top 100 Linguistic Indicators of Bible-Related Tweets</title>
		<link>http://www.openbible.info/blog/2009/10/top-100-linguistic-indicators-of-bible-related-tweets/</link>
		<comments>http://www.openbible.info/blog/2009/10/top-100-linguistic-indicators-of-bible-related-tweets/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 02:52:31 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Collective Intelligence]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=135</guid>
		<description><![CDATA[When people tweet about Bible verses on Twitter, what words do they use? Here are the top 100: bible lord christ gospel psalm god psalms corinthians preach shall heaven readings church spirit righteous verse lectionary verses spiritual ministry pray enemies thou tongue creation wisdom deuteronomy testament strength refuge therefore kingdom romans holy thankful thy reading [...]]]></description>
			<content:encoded><![CDATA[<p>When people tweet about Bible verses on Twitter, what words do they use? Here are the top 100:</p>
<ol>
<li>bible</li>
<li>lord</li>
<li>christ</li>
<li>gospel</li>
<li>psalm</li>
<li>god</li>
<li>psalms</li>
<li>corinthians</li>
<li>preach</li>
<li>shall</li>
<li>heaven</li>
<li>readings</li>
<li>church</li>
<li>spirit</li>
<li>righteous</li>
<li>verse</li>
<li>lectionary</li>
<li>verses</li>
<li>spiritual</li>
<li>ministry</li>
<li>pray</li>
<li>enemies</li>
<li>thou</li>
<li>tongue</li>
<li>creation</li>
<li>wisdom</li>
<li>deuteronomy</li>
<li>testament</li>
<li>strength</li>
<li>refuge</li>
<li>therefore</li>
<li>kingdom</li>
<li>romans</li>
<li>holy</li>
<li>thankful</li>
<li>thy</li>
<li>reading</li>
<li>rejoice</li>
<li>understanding</li>
<li>faithful</li>
<li>message</li>
<li>earth</li>
<li>blessed</li>
<li>exodus</li>
<li>deut</li>
<li>faith</li>
<li>wise</li>
<li>beginning</li>
<li>pastor</li>
<li>chapel</li>
<li>chapter</li>
<li>survey</li>
<li>anger</li>
<li>resurrection</li>
<li>risen</li>
<li>read</li>
<li>hearts</li>
<li>chronicles</li>
<li>salvation</li>
<li>flesh</li>
<li>servant</li>
<li>glory</li>
<li>praying</li>
<li>kings</li>
<li>sheep</li>
<li>praise</li>
<li>trust</li>
<li>prosperity</li>
<li>bless</li>
<li>heavens</li>
<li>deeds</li>
<li>toward</li>
<li>discussion</li>
<li>whoever</li>
<li>speaks</li>
<li>ye</li>
<li>hath</li>
<li>amen</li>
<li>teaching</li>
<li>thess</li>
<li>apostles</li>
<li>preparing</li>
<li>eph</li>
<li>eccl</li>
<li>path</li>
<li>fear</li>
<li>upon</li>
<li>presence</li>
<li>inspire</li>
<li>search</li>
<li>zechariah</li>
<li>seek</li>
<li>teach</li>
<li>wrath</li>
<li>commandments</li>
<li>believers</li>
<li>humility</li>
<li>spoke</li>
<li>thee</li>
<li>devo</li>
</ol>
<h3>Background</h3>
<p>Extracting Bible references from text means identifying whether a given piece of text is referring to a Bible verse or something else. For example, the meaning of <em>Acts 2</em> depends on context:</p>
<ul>
<li><strong>Referring to Bible passage:</strong> <em>Acts 2</em> recounts the early church.</li>
<li><strong>Not referring to Bible passage:</strong> She’s 5 years old but <em>acts 2</em>.</li>
</ul>
<p>When you encounter a phrase that could be a Bible reference, you have to look at context to determine whether the phrase is a Bible reference. Humans can make this leap pretty easily, but computers need rigorous models and lots of training data to guess whether an ambiguous phrase is a Bible reference. In the above example, the phrase &#8220;early church&#8221; is a strong indicator that the phrase &#8220;Acts 2&#8243; is a Bible reference, while the phrase &#8220;years old&#8221; is an indicator the other way.</p>
<p>Twitter, with its high volume of content and decent search engine, provides lots of training data.</p>
<h3>Methodology</h3>
<p>Using the <a href="http://apiwiki.twitter.com/Twitter-Search-API-Method%3A-search">Twitter Search API</a>, I downloaded 30,000 tweets possibly containing Bible references (e.g., [john 3], [jeremiah 29]) and then categorized them by hand as referring to a Bible verse or not.</p>
<p>I then ran a <a href="http://en.wikipedia.org/wiki/Naive_Bayes_classifier">Naive Bayes algorithm</a> on the resulting tweets to produce the above list, which contains the words that most strongly indicate the presence of a Bible reference.</p>
<p>This list suffers from sample bias, of course: a different set of tweets would produce a different list. In addition, the list is Twitter-centric; the results may not carry over into blogs or other media. (People substitute the number “2” for the word “to” and “4” for “for” on Twitter more frequently than they do elsewhere, for example, which oversamples content like “I’m meeting Matthew 4 dinner.”)</p>
<h3>See It in Action</h3>
<p><a href="http://www.openbible.info/realtime/">Search for Bible references on Twitter</a>. Use the <em>relevant</em> and <em>not relevant</em> buttons to improve the filtering. I haven&#8217;t formally announced this new feature of OpenBible.info yet; consider the link a preview.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2009/10/top-100-linguistic-indicators-of-bible-related-tweets/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Top 100 Things Twitterers Are Giving Up for Lent</title>
		<link>http://www.openbible.info/blog/2009/02/top-100-things-twitterers-are-giving-up-for-lent/</link>
		<comments>http://www.openbible.info/blog/2009/02/top-100-things-twitterers-are-giving-up-for-lent/#comments</comments>
		<pubDate>Fri, 27 Feb 2009 12:50:14 +0000</pubDate>
		<dc:creator>openbible</dc:creator>
				<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Visualizations]]></category>

		<guid isPermaLink="false">http://www.openbible.info/blog/?p=61</guid>
		<description><![CDATA[Some you’d expect (alcohol, chocolate), some are ironic (giving up Lent for Lent, giving up giving up things), some are odd (pants, lint), some are anti-religious (religion, Catholicism), and some are tech-related (Facebook, Twitter—even “Facebook and Twitter” makes the list). Complete List Facebook (654) Twitter (317) Chocolate (272) Lent (216) Alcohol (187) Soda (139) Coffee [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://a.openbible.info/blog/2009-02-lent-big.png"><img src="http://a.openbible.info/blog/2009-02-lent.png" alt="A Wordle of the below words shows the relative frequency of each one." width="800" height="516" /></a></p>
<p>Some you’d expect (alcohol, chocolate), some are ironic (giving up Lent for Lent, giving up giving up things), some are odd (pants, lint), some are anti-religious (religion, Catholicism), and some are tech-related (Facebook, Twitter—even “Facebook and Twitter” makes the list).</p>
<h3>Complete List</h3>
<ol>
<li>Facebook (654)</li>
<li>Twitter (317)</li>
<li>Chocolate (272)</li>
<li>Lent (216)</li>
<li>Alcohol (187)</li>
<li>Soda (139)</li>
<li>Coffee (129)</li>
<li>Meat (126)</li>
<li>Religion (102)</li>
<li>Swearing (94)</li>
<li>Sweets (92)</li>
<li>Catholicism (90)</li>
<li>Giving up things (80)</li>
<li>Work (70)</li>
<li>Beer (60)</li>
<li>Sex (59)</li>
<li>Fast food (57)</li>
<li>Facebook and twitter (57)</li>
<li>Sugar (45)</li>
<li>Stuff (43)</li>
<li>Booze (41)</li>
<li>Smoking (39)</li>
<li>Food (39)</li>
<li>Procrastination (38)</li>
<li>Internet (37)</li>
<li>Cursing (36)</li>
<li>Caffeine (35)</li>
<li>TV (33)</li>
<li>Pancakes (33)</li>
<li>Social networking (33)</li>
<li>Sleep (32)</li>
<li>Candy (32)</li>
<li>Diet Coke (29)</li>
<li>Giving up (29)</li>
<li>You (28)</li>
<li>Wine (28)</li>
<li>Lint (28)</li>
<li>Cheese (28)</li>
<li>Bread (26)</li>
<li>Shopping (26)</li>
<li>Sobriety (26)</li>
<li>Abstinence (24)</li>
<li>Cussing (24)</li>
<li>Red meat (24)</li>
<li>Chips (23)</li>
<li>Internet porn (22)</li>
<li>Christianity (22)</li>
<li>Nothing (21)</li>
<li>French fries (21)</li>
<li>Jesus (21)</li>
<li>Sarcasm (19)</li>
<li>Junk food (19)</li>
<li>Starbucks (18)</li>
<li>Ice cream (18)</li>
<li>MySpace (18)</li>
<li>Cookies (18)</li>
<li>Fried food (17)</li>
<li>Complaining (17)</li>
<li>God (16)</li>
<li>New years resolutions (15)</li>
<li>Social media (15)</li>
<li>Pizza (14)</li>
<li>Tweeting (14)</li>
<li>Carbs (13)</li>
<li>MySpace and Facebook (13)</li>
<li>Carbon (13)</li>
<li>Eating out (13)</li>
<li>Stress (13)</li>
<li>Flaky guys (12)</li>
<li>Laziness (12)</li>
<li>Texting (12)</li>
<li>Me (11)</li>
<li>Some of your money (11)</li>
<li>Annoying me (11)</li>
<li>Sacrifice (11)</li>
<li>School (11)</li>
<li>Hope (10)</li>
<li>Rice (10)</li>
<li>Coke (10)</li>
<li>Porn (10)</li>
<li>The snooze button (10)</li>
<li>Guilt (10)</li>
<li>Men (9)</li>
<li>Obama (9)</li>
<li>Church (9)</li>
<li>My job (9)</li>
<li>Homework (9)</li>
<li>Self denial (9)</li>
<li>Moderation (9)</li>
<li>Exercise (8)</li>
<li>Bacon (8)</li>
<li>Dieting (8)</li>
<li>Paying taxes (8)</li>
<li>Dr Pepper (8)</li>
<li>Gossip (8)</li>
<li>Beef (8)</li>
<li>Pants (7)</li>
<li>My sanity (7)</li>
<li>Celibacy (7)</li>
<li>Shaving (7)</li>
</ol>
<h3>About</h3>
<p>Created using the <a href="http://apiwiki.twitter.com/Search+API+Documentation">Twitter Search API</a> and <a href="http://www.wordle.net/">Wordle</a>. Data based on analysis of 15,000 tweets from February 22-26, 2009.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.openbible.info/blog/2009/02/top-100-things-twitterers-are-giving-up-for-lent/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

