<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Tuberculosis on Microbes made me do it</title><link>https://mbhall88.github.io/tags/tuberculosis/</link><description>Recent content in Tuberculosis on Microbes made me do it</description><generator>Hugo</generator><language>en-au</language><lastBuildDate>Wed, 21 Jun 2023 00:00:00 +0000</lastBuildDate><atom:link href="https://mbhall88.github.io/tags/tuberculosis/index.xml" rel="self" type="application/rss+xml"/><item><title>Searching for shared sequence between Mycobacterium tuberculosis and Homo sapiens</title><link>https://mbhall88.github.io/post/mtb-human-shared-sequence/</link><pubDate>Wed, 21 Jun 2023 00:00:00 +0000</pubDate><guid>https://mbhall88.github.io/post/mtb-human-shared-sequence/</guid><description>&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#motivation"&gt;Motivation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#shared-k-mer-content"&gt;Shared &lt;em&gt;k&lt;/em&gt;-mer content&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#aligning-reads"&gt;Aligning reads&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#summary"&gt;Summary&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#references"&gt;References&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a href="https://doi.org/10.5281/zenodo.8068147"&gt;&lt;img alt="DOI" loading="lazy" src="https://zenodo.org/badge/DOI/10.5281/zenodo.8068147.svg"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="motivation"&gt;Motivation&lt;/h1&gt;
&lt;p&gt;We are in the early stages of planning a &lt;em&gt;Mycobacterium tuberculosis&lt;/em&gt; (MTB) analysis pipeline for a research project in Papua New Guinea. We&amp;rsquo;ll be sequencing sputum samples with Oxford Nanopore Technologies (ONT) devices and were thinking of different ways of decontaminating the data - i.e. remove anything non-MTB. Sputum samples traditionally have a lot of host (human) reads and reads from a variety of bacteria. Traditionally the MTB component is quite small&lt;sup&gt;1&lt;/sup&gt;. One component of this pipeline will be to upload sequencing reads to a remote/cloud server, so any reduction in file size will make uploads faster. As human reads are not used in any analysis steps, and will need to be removed prior to making any data available, we thought we could simplify things by removing human data as the first step. Our idea was to align reads to the human genome and just remove anything that aligns. However, one concern with this approach was whether any MTB reads could be lost in the process. This effectively boils down to the question: &lt;strong&gt;Do &lt;em&gt;Mycobacterium tuberculosis&lt;/em&gt; and &lt;em&gt;Homo sapiens&lt;/em&gt; share genomic sequence&lt;/strong&gt;? After a literature search, I was unable to find an answer - which seemed quite surprising. My suspicion is that most people just assume they do not. (Or my literature searching skills are poor.) So let&amp;rsquo;s take a look.&lt;/p&gt;</description></item></channel></rss>