{"id":1183,"date":"2008-04-08T20:48:42","date_gmt":"2008-04-09T00:48:42","guid":{"rendered":"https:\/\/dwan.org\/?p=1183"},"modified":"2019-10-26T20:49:52","modified_gmt":"2019-10-27T00:49:52","slug":"23andme-data","status":"publish","type":"post","link":"https:\/\/dwan.org\/index.php\/2008\/04\/08\/23andme-data\/","title":{"rendered":"23andMe Data"},"content":{"rendered":"\n<p>Just to get this out of the way: <a href=\"http:\/\/23andme.com\">23andme<\/a>. There. Google has been fed Welcome, news-hungry bloggers with constantly running searches.<\/p>\n\n\n\n<p>I finally got my data yesterday, a full 11 weeks after my tube of spit was received. My <a href=\"http:\/\/fdmts.livejournal.com\/442704.html\">whining<\/a> was <a href=\"http:\/\/www.genetic-future.com\/2008\/03\/23andme-laboratory-delays.html\">noticed,<\/a> and eventually there was some <a href=\"http:\/\/spittoon.23andme.com\/2008\/04\/01\/sorry-for-the-delay-a-message-from-23andme-founders-anne-wojcicki-and-linda-avey\/\">explanation<\/a> on the company blog. In response, there was <a href=\"http:\/\/www.genetic-future.com\/2008\/04\/23andme-delays-explained.html\">sympathetic commentary<\/a> as well as some <a href=\"http:\/\/thegenesherpa.blogspot.com\/2008\/04\/clia-whats-that.html\">outright snarkiness<\/a> in response. Sheesh, <a href=\"http:\/\/www.thegenesherpa.blogspot.com\/\">Sherpa.<\/a> We all know you\u2019re smart. Relax.<\/p>\n\n\n\n<p>In any event, I say \u201cwhatever.\u201d Business is hard. I\u2019ve finally got my data. Let\u2019s play.<\/p>\n\n\n\n<p>I deliberately held off on judging the web interface until now. Their example genotypes are loaded with somewhat interesting stuff. How would it look on data from an average height, average build, no major complaints white guy? My first thought was that 23 and me has done a remarkable amount of work to make the basic biology comprehensible to the interested layperson. I have no idea whether <a href=\"http:\/\/www.decodeme.com\/\">Decodeme<\/a> and <a href=\"http:\/\/navigenics.com\/\">Navigenics<\/a> have done similar flash and java whizziness. Perhaps unfortunately for me, I\u2019ve had a bunch of biology classes and have been doing bioinformatics for a living for more than a couple of years. Their tutorials are old hat, and while the miraculous wonder of DNA replication is still pretty newsworthy, but that\u2019s not why I bought a ticket for this particuar ride.<\/p>\n\n\n\n<p>If you don\u2019t already know how DNA, SNPs, and Microarrays work, it\u2019s a good intro.<\/p>\n\n\n\n<p>My first impression of looking at the <a href=\"https:\/\/www.23andme.com\/you\/journal\/\">Gene Journal<\/a> was <strong>\u201cthis is it?\u201d<\/strong> I <a href=\"http:\/\/fdmts.livejournal.com\/431121.html\">knew up front<\/a> that we know next to nothing about how these data points relate to physical predispositions, aptitudes, and diseases. Still, I was bummed when I saw only 58 entries in the gene journal. These are the detailed, annotated pages that 23 and me provides about specific physical conditions. I did get a snicker out of the fact that one of the subsections of the journal is named <a href=\"https:\/\/www.23andme.com\/you\/journal\/?offset=0&limit=20&sort_column=title&sort_direction=asc&old_query=&old_body_part=all&view=all&query=&body_part=nether\">nether regions<\/a>.<\/p>\n\n\n\n<p>Since I was already giggling, I dug into the <a href=\"https:\/\/www.23andme.com\/you\/journal\/prostate\/overview\/\">Prostate Cancer<\/a> page, and saw sort of what I was expecting to see. A nice little write up on what prostate cancer is and what it does, and then a couple of charts comparing my relevant SNP with the population baselines. They even do a slick little thing with the set of five markers that are currently implicated in this condition.<\/p>\n\n\n\n<p>As I clicked through, the data was absolutely inconclusive \u2026 as expected. A percent higher here, a percent lower there. The very best research that they cite has at least two studies, each of which has at least 1,000 participants. That seems large until you start dealing in fractional percentages, at which point you\u2019ve got barely enough data to say anything at all.<\/p>\n\n\n\n<p>Next, I noticed what was missing. The stone cold indicators currently used in genetic testing for Alzheimer\u2019s, Huntington\u2019s, and so on are simply not present in the gene journal. The Breast Cancer indicators are in there, but there are only <a href=\"https:\/\/www.23andme.com\/you\/journal\/breastcancer\/overview\/\">two of them<\/a>. Contrast with the <a href=\"http:\/\/www.snpedia.com\/index.php?title=Breast_cancer\">SNPedia entry<\/a> where an ungodly list is present.<\/p>\n\n\n\n<p><a href=\"http:\/\/snpedia.com\">SNPedia<\/a> is run by my friend and colleague <a href=\"http:\/\/cariaso.livejournal.com\/\">cariaso<\/a>. The vast majority of annotations in there are the product of his evenings and weekends. Let me say that again: One or two people, in the evenings, have presented a far more comprehensive resource than a team of venture capital funded biologists. Now, the SNPedia AJAX and java whizziness is not nearly on par with 23 and me, but that\u2019s not what I\u2019m here for.<\/p>\n\n\n\n<p>To continue, <a href=\"http:\/\/cariaso.livejournal.com\/\">cariaso<\/a> has put forward links from SNPedia into 23 and me\u2019s page. I can very easily take some genes that are implicated according to SNPedia, go into my raw data from 23 and me, and bounce back to see what that genotype might mean. In fact, I can do the math that 23 and me might have done and calculate a more comprehensive set of odds than they did. Turns out I\u2019m at a slightly elevated risk for breast cancer. I\u2019m also a guy. Take <em>that<\/em> statistics!<\/p>\n\n\n\n<p>Back to Alzheimer\u2019s. That\u2019s also a simple one. I\u2019ve got the \u201cless likely to get Alzheimer\u2019s\u201d version of <a href=\"http:\/\/www.snpedia.com\/index.php?title=ApoE\">APoE<\/a>. However, it took a good 10 minutes of clicking back and forth between the two sites. <a href=\"https:\/\/www.23andme.com\/you\/explorer\/gene\/?gene_name=apoe\">The 23 and me page about APOE<\/a> tells me less than nothing. Further, there isn\u2019t even a gene journal entry for this largely genetically determined and very well researched disease.<\/p>\n\n\n\n<p>I can see a reason for this: If I was in the business of selling genotyping, I would have nightmares about the first time a customer killed himself when he learned about some propensity or condition. The same sort of standards that apply to revealing HIV test results might well apply here. As a society and as individuals we will need to spend some hard time thinking about how much we want to know and when we want to know it.<\/p>\n\n\n\n<p>So they\u2019ve deliberately (it seems) scrubbed the gene journal of easy and clear evidence for simply diagnosable risk factors. Thanks guys. I can download the raw data and do the work myself \u2026 but a summary page eludes me. That\u2019s the useful stuff that I was looking for. I know that I have a \u201c58% chance\u201d of having brown eyes, and that my earwax is wet. Further, I knew that my dad\u2019s fathers came from Ireland, and that I\u2019m Northern European on both sides.<\/p>\n\n\n\n<p>Me? I want to know it all, and I want to know it now. Therefore, I <a href=\"https:\/\/www.23andme.com\/you\/download\/\">downloaded my data<\/a> into a 4.5MB text file. First look:<\/p>\n\n\n\n<p><code><br>\n<strong>Total Lines<\/strong>:  576119<br>\n<strong>Comments<\/strong>:  14 lines<br>\n<strong>SNP data points<\/strong>:  576105<br>\n<strong>Uncalled (--)<\/strong>:  3189 (0.5%)<\/code><\/p>\n\n\n\n<p>So that seems fair to me. Half a percent \u201cno call\u201d rate is actually pretty darn good. Note that this says nothing about <em>error<\/em> rate, nor about quality of the called SNPs. It does say a bit about where they set their quality thresholds. I\u2019ll dig into that on a second look.<\/p>\n\n\n\n<p>Then I used my colleague <a href=\"http:\/\/cariaso.livejournal.com\/\">cariaso<\/a>\u2018s <a href=\"http:\/\/www.snpedia.com\/index.php?title=Promethease\">Promethease<\/a> to churn through an analysis. This is a program that does exactly what I was doing by hand. For each data point, dig into SNPedia, figure out whether it\u2019s associated with any diseases, and if I have the rare version. Then it produces a report listing my information both by disease and by rarity. That\u2019s what I was looking for. He and I had a nice chat about usage, being trapped on Windows, and about data privacy.<\/p>\n\n\n\n<p>Even with my shiny promethease report, there\u2019s still not much information in there. The state of human genetics is \u201cjust getting started.\u201d However, I plan to run Promethease once a month or so as a way to keep up on the advances in science relevant to <em>me<\/em>. Hopefully in a few years there will be more to read about.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Just to get this out of the way: 23andme. There. Google has been fed Welcome, news-hungry bloggers with constantly running searches. I finally got my data yesterday, a full 11 weeks after my tube of spit was received. My whining was noticed, and eventually there&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[],"class_list":["post-1183","post","type-post","status-publish","format-standard","hentry","category-genomics"],"_links":{"self":[{"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/posts\/1183","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/comments?post=1183"}],"version-history":[{"count":1,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/posts\/1183\/revisions"}],"predecessor-version":[{"id":1184,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/posts\/1183\/revisions\/1184"}],"wp:attachment":[{"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/media?parent=1183"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/categories?post=1183"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dwan.org\/index.php\/wp-json\/wp\/v2\/tags?post=1183"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}