VeggiePharm: American Gut and uBiome Compared

Wednesday, December 3, 2014

American Gut and uBiome Compared

In July this year, I conducted an experiment. I sent identical fecal samples to American Gut and uBiome to see how they would compare. There have been several discussions on this subject, seeming to make the two rival microbiome test labs flawed in their methods. An explanation from uBiome.

One argument was that even on an individual turd, there would be differences in the microbes found on it's surface. Makes sense.

Another argument was that the normal sampling method, ie. wiping a cotton swab on used toilet paper, would also lead to different microbes being detected. Sounds right.

I did things a bit differently. I did my "#2" in a new, food grade, plastic bag. Kneaded it thoroughly. Then I touched the exact same spot lightly with the swabs provided by the two companies.

I now have the results!

Unfortunately, the two companies provide completely different looking reports, so it is impossible to hold them up side-by-side and look for inconsistencies. What I did instead, was go line by line on each report and any time I found an exact same genus listed by name, I wrote down the results to compare. But first here is a bar chart I crafted to give an overall impression:

I am pretty happy with the phyla numbers. Better than the other comparisons I saw.

Here are the genus level comparisons:

Genus	AmGut %	uBiome %
Bacteroides	27	10
Fecalibacterium	21	17
Bifidobacterium	4.8	8.8
Coprococcus	2.8	.96
Lachnospira	2	1.8
Paraprevotella	1.5	1.5
Roseburia	1.3	14.7
Blautia	.7	3.5
Ruminococcus	.58	3.52
Clostridium	.4	.6
Sutterella	.38	.28
Desulfovibrio	.35	1.55
Dorea	.24	.76
Akkermansia	.19	.12
Odoribacter	.07	.064
Slackia	.04	.08

Major discrpancies in yellow

Strange. Some really big discrepancies and also some eerie similarities. I'd say overall they did a pretty good job, but when they were off...they were WAY off.

Overall, I'm impressed with the comparison. It's possible that the major discrepancies were due to sampling differences. And anyway, this is all just for fun. I am glad that at least it seems that both companies found pretty much the same microbes, and in the same relative abundances. I feel good recommending either company, but would caution against head-to-head comparisons.

Obviously it makes a direct comparison between AmGut and uBiome, like I did here, pointless. And if anyone was looking at this chart to make grand statements, you were wrong and may want to correct them.

Gut Microbe (Genus Level)	Real Food (uBiome) My Results	uBiome Average Results	Potato Starch Added (AmGut) My Results
F. Prausnitzi	17.2%	9.3%	4.8%
Roseburia	14.7%	3.4%	.41%
Bacteroides	10.2%	9.4%	not shown at this level
Bifidobacteria	8.81%	.88%	11.32%
Blautia	3.52%	7.7%	.76%
Ruminococcus	3.2%	6.06%	13%
Eubacterium	.8%	.9%	.1%
Akkermansia	.12%	1.2%	.07%
Prevotella	.001%	7.36%	.0001%

If there are any biotech geeks out there, the uBiome raw data can be found here: uBiome.txt

And the AmGut Genus list can be found here: AmGut Genus.xls

If anyone knows how to get the American Gut raw data, please drop me a comment or email. I found a link that says it's available through EBI. Has anyone tried to figure it out yet?

The raw data can be fetched from the European Bioinformatics Institute. EBI is part of The International Nucleotide Sequence Database Collaboration and is a public warehouse for sequence data. The deposited American Gut Project accessions so far are:

ERP003819

ERP003822

ERP003820

ERP003821

ERP005367

ERP005366

ERP005361

ERP005362

Processed sequence data and open-access descriptions of the bioinformatic processing can be found at our Github repository.
Sequencing of American Gut samples is an on-going project, as are the bioinformatic analyses. These resources will be updated as more information is added and as more open-access descriptions are finalized.

27 comments:

AnonymousDecember 3, 2014 at 4:29 PM
Tim,
This is brilliant work. It is hugely important because it lets us know that much of the speculation that is based on these tests is so far probably pretty worthless. First, I wonder how many samples being reported (and used in “studies”) were homogenized in the way you did – I suspect a minuscule number – and therefore the results may be entirely random and bear no relationship to the actual proportions of the different phyla or genera. Secondly, the substantive differences between these two sets of results based on the single sample (which you highlighted in yellow) means that at least one of these analyses is wrong (and perhaps both are). Finally, even if homogenous samples were being used, and the analysis was accurate, we have no idea if it is the relative proportions of these bacteria in our microbiome that matter – we have seen a huge amount of speculation based on the percentage of one phylum compared to another, or one genus to another, when it might be the absolute quantities that are significant. Those of us who have experimented with supplementation with different fibres, resistant starches or other polysaccharides, are very aware that stool volume can increase magnificently. The total size of our microbiome maybe much more important than the relative proportions of individual bacteria. I feel that as yet we know almost next to nothing for certain about any of this – we are only just getting a tiny sense of how important it might all be. Please keep up the good work.
ReplyDelete
Replies
Tim SteeleDecember 3, 2014 at 6:15 PM
I was just reading a great article that Gemma sent today on some of the problems with microbiota science. three-voice debate about gut microbiota research

One issue I concur with is the overuse of animal models for human microbe studies, and another is that there seems to be a rush on to get gut articles published, possibly leading to shoddy conclusions.

I think it's up to us to really get this all figured out. We need to keep trying different things and reporting what we are finding. I'm in this for the long run!

ReplyDelete
Replies
AnonymousDecember 4, 2014 at 7:09 AM
What are your thoughts on Dr BG's comments regrading RS2 suppressing bifido based on the numbers you are seeing with your results? I know you eat a wide range of RS fiber, but it would seem your numbers are still quite good when using supplemental PS.
ReplyDelete
Replies
Tim SteeleDecember 4, 2014 at 7:52 AM
Just finishing up a post comparing a high PS diet and zero PS diet and two AmGut samples. Basically, what I see, is that potato starch creates huge growth of Bifido, of course, Dr. BG will counter with "it's the wrong type of Bifido," but I can see no basis for that.

Hopefully will be up in just a little bit, just need to check spelling and format.
ReplyDelete
Replies
AnonymousDecember 4, 2014 at 11:55 AM
http://vegetablepharm.blogspot.co.uk/2014/12/american-gut-and-ubiome-compared.html?utm_source=feedburner&utm_medium=email&utm_campaign=Feed:+Vegetablepharm+%28VegetablePharm%29
ReplyDelete
Replies
AnonymousDecember 4, 2014 at 11:56 AM
Sorry, copy and paste fail there. What I meant to say was:

"Kneaded it thoroughly." - Love this blog. I was thinking of doing something similar myself, though I really don't buy the claim that these two companies should be so far apart. I suspect one of them - though it could easily be both - are doing something wrong.
ReplyDelete
Replies
AnonymousDecember 5, 2014 at 7:01 AM
Wouldn't surprise me in the least that if you submitted the same sample to the same lab under two different aliases you would probably get back varied results. Would be an interesting test of the capabilities of the lab & procedures.
ReplyDelete
Replies
AnonymousJanuary 11, 2015 at 6:02 PM
Is there a way to interpret the results on your own? Say, you have too much "bad" bacteria showing up in the report and need to rebalance? I really believe my gut issues led to Hashimoto's Thyroiditis, but I've never had my gut checked.
ReplyDelete
Replies
dnvrdaveJanuary 27, 2015 at 8:32 AM
Hey Tim,

Great blog! I’m glad I found it! I have results from 3 uBiome samples and 5 American Gut samples. I’ve been writing Python scripts to try to reproduce their results, just for fun and education. I’m also in the Coursersa Bioinformatics class, but more for reference than to do the homework (I work full time too). So far, I have found that uBiome gives us about 20 times as many sequences per sample as American Gut (900,000 vs 40,000), but at first glance, the American Gut samples look more consistent.

I got my uBiome raw data right off the web site. I don’t know if everyone can do that, since I chose a donation level that included data analysis, I think.

I followed the American Gut project notebook, as you did, to look for my raw sample data (a tedious process!), and I found only one of 5 samples. But when I sent them an email, they sent me links to the other 4 samples. They are very helpful, so you might just ask them. I’m using the Greengenes taxonomy data from May 2013, as the notebook says, but I have questions about that. Each OTU (e.g. specific genus or species) has up to 64,000 sequences in the fasta file, and I want only 1 “truth” sequence, to compare my sequences against. Here’s a sample of my output, showing that Greengeens gives us 2900 unique OTUs:
2900 unique OTUs written to COUNT PCT
p__Actinobacteria; c__Actinobacteria; o__Actinomycetales; f__Corynebacteriaceae; g__Corynebacterium; s__ 64385 5.098
p__Firmicutes; c__Bacilli; o__Lactobacillales; f__Streptococcaceae; g__Streptococcus; s__ 59282 4.694

I’m interested in Akkermansia muciniphila because my level is much higher than most people. It’s the bacterium that has been shown to posibly contribute to obesity, when your levels are too low. Greengenes gives 1535 sequences for this species, and I copy/pasted a couple of them to BLAST, comparing to their known Akkermansia muciniphila genome. The longest sequence matched the entire 16s rRNA gene sequence perfectly! So I’m hoping I can just use the longest sequence from each of the 2900 unique OTUs in Greengenes as my “truth” catalog to identify all my sequences.

By the way, I was shocked to find that my two earliest American Gut samples (April and June 2013) have completely different results now on the web page than they originally did. My guess is that they weren’t using the May 2013 Greengenes catalog yet in their original processing.

My latest experiment is to compare a vegan sample to a meat/dairy sample from American Gut. But it will probably be months before they finish processing. I haven’t even taken the meat/dairy sample yet. By the way, American Gut has completely moved to UCSD.
ReplyDelete
Replies
Tim SteeleJanuary 27, 2015 at 9:14 AM
Ha! A fellow gut-geek!

Amazing how many tools available to the average schmuck willing to learn to use them. BLAST is just the tip of the iceberg. Send me an email and I'll send you copies of the lecture slides from last semester with links to about a dozen or more such tools used for comparative analysis.

I had fun plugging all the raw data into MG-rast and playing with all the tools there.

My problem is finding the time required to fully get into it. You really need to keep on top of what you send in as some things take several days to compute.

I love that AmGut and uBiome are so cheap and, from what I see, accurate. I think we are closing in on identifying some key species and markers for gut health. With all of the results they are collecting, they can develop micro-arrays and an app to decipher, bringing us one step closer to a real-time, home-use tool for examining our gut flora.

Hey, if you ever feel like writing up your experiments you I can give you author rights to post them as a blog-post here. It might be a nice reference for future gut sleuths, and give you a place to store your data for easy reference.

Thanks for the note!
ReplyDelete
Replies
dnvrdaveJanuary 29, 2015 at 9:08 PM
Tim,

Did you end up getting all your AmGut raw data? And your uBiome data? Do you know why uBiome uses 20 times as many samples? Have you ever done or seen a study that shows the bacteria types that are most similar, i.e. the ones whose 16s rRNA sequences get confused for each other most easily. Maybe that would explain the uBiome/AmGut discrepancies you show above. I'm going to do my own comparison, just for fun.

By the way, AmGut told me I wouldn't be able to directly compare their sequences to uBiome because they look at a different piece of the 16s gene. I found that they overlap by 85% for Akkermansia muciniphila (23 nucleotides apart out of 150). I don't think we need to directly compare them anyway, to get meaningful results. But maybe the discrepancies you see are due to where they are looking on the gene. I might start with yours and see what I find, e.g. I'll try to see where AmGut and uBiome were looking on the Roseburia 16s gene (will they also be 85% overlapping?) and look for other OTUs that are similar (esp to AmGut, since yours are "missing"). Do you know of tools that already do this? Either way, I want to build my own. Also by the way, my Roseburia ranged from .9% to 1.8% (two uBiome samples), with two AmGut at 1.2% and 1.4%, so it looks like my hypotheses will fail (i.e. they compare fine for me).
ReplyDelete
Replies
dnvrdaveFebruary 4, 2015 at 7:27 PM
I have an MG-Rast account but haven't run any data yet. This poster has some negative comments on it. I've been told to use QIIME instead. I'm only learning now that identification of sequences is complicated and imperfect. I'll have to work hard in that next Bioinformatics class to do more on my own without falling into too many traps.
http://i.imgur.com/Up4mGEE.png
ReplyDelete
Replies
Tim SteeleMay 8, 2015 at 7:01 AM
Happy Mother's Day! Your mom gave you the first important seed of your microbiome, so why not thank her with a loving gift? From now until Sunday, all uBiome kits are 50% off with the discount code THANKSMOM.

http://ubiome.us6.list-manage.com/track/click?u=0f3df5f7888317bb5a6634929&id=70c1de7274&e=1cec30584c
ReplyDelete
Replies
AnonymousJuly 1, 2015 at 9:31 PM
Many thanks for your blog, Tim, so help so much people with your informations!
The quantity of bacteria (count) of the Ubiome test: can we relate this number to the full number of bacteria (100 trillions), that we should have in the gut?
ReplyDelete
Replies
AnonymousJuly 8, 2015 at 12:34 PM
Thank you for your answer. Richard Sprague writes, that "count" is the actual number of organisms found in the sample, and "count_norm" means a "normalized" version of the count, which we can think as a percentage.
It looks to me, that "count" is a number, and "count norm" is, when divided to 10000, a percentage.
When I summary all numbers of "count", then my result is 1365000.
ReplyDelete
Replies

Add comment

Pages

Wednesday, December 3, 2014

American Gut and uBiome Compared

27 comments: