<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?action=history&amp;feed=atom&amp;title=Meryl</id>
		<title>Meryl - Revision history</title>
		<link rel="self" type="application/atom+xml" href="http://stab.st-andrews.ac.uk/wiki/index.php?action=history&amp;feed=atom&amp;title=Meryl"/>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Meryl&amp;action=history"/>
		<updated>2026-04-30T06:09:06Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.30.0</generator>

	<entry>
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?title=Meryl&amp;diff=1829&amp;oldid=prev</id>
		<title>Rf at 16:16, 4 August 2017</title>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Meryl&amp;diff=1829&amp;oldid=prev"/>
				<updated>2017-08-04T16:16:57Z</updated>
		
		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr style=&quot;vertical-align: top;&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;Revision as of 16:16, 4 August 2017&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l4&quot; &gt;Line 4:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 4:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;It&amp;#039;s by Brian Walenz of Kanu (MinION genome assembly fame).&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;It&amp;#039;s by Brian Walenz of Kanu (MinION genome assembly fame).&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;= Simple example =&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Typically, it&amp;#039;s a two stage process. A database of kmers must be built. This is the &amp;#039;&amp;#039;&amp;#039;-B&amp;#039;&amp;#039;&amp;#039; option below:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; meryl -B -C -m 22 -s /data/references/ecolik12.fasta -o ecoli-22mers&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&amp;#039;&amp;#039;-C&amp;#039;&amp;#039;&amp;#039; is for canonical, which is kmers on both forward(&amp;#039;&amp;#039;&amp;#039;-f&amp;#039;&amp;#039;&amp;#039;) and (&amp;#039;&amp;#039;&amp;#039;-r&amp;#039;&amp;#039;&amp;#039;) strands.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Then the second stage would be as follows:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; meryl -Dh -s ecoli-22mers &amp;gt; ecoli-22mers.fasta&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= Help file =&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= Help file =&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Rf</name></author>	</entry>

	<entry>
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?title=Meryl&amp;diff=1828&amp;oldid=prev</id>
		<title>Rf: Created page with &quot;= Introduction =  This is a kmer coounting program, part of a larger suite of tools at kmer.sourceforge.net  It&#039;s by Brian Walenz of Kanu (MinION genome assembly fame).  = Hel...&quot;</title>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Meryl&amp;diff=1828&amp;oldid=prev"/>
				<updated>2017-08-04T16:07:13Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot;= Introduction =  This is a kmer coounting program, part of a larger suite of tools at kmer.sourceforge.net  It&amp;#039;s by Brian Walenz of Kanu (MinION genome assembly fame).  = Hel...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
This is a kmer coounting program, part of a larger suite of tools at kmer.sourceforge.net&lt;br /&gt;
&lt;br /&gt;
It&amp;#039;s by Brian Walenz of Kanu (MinION genome assembly fame).&lt;br /&gt;
&lt;br /&gt;
= Help file =&lt;br /&gt;
&lt;br /&gt;
It usually overflows on the screen, so here it is in full:&lt;br /&gt;
&lt;br /&gt;
 usage: meryl [personality] [global options] [options]&lt;br /&gt;
 &lt;br /&gt;
 where personality is:&lt;br /&gt;
         -P -- compute parameters&lt;br /&gt;
         -B -- build table&lt;br /&gt;
         -S -- scan table&lt;br /&gt;
         -M -- &amp;quot;math&amp;quot; operations&lt;br /&gt;
         -D -- dump table&lt;br /&gt;
 &lt;br /&gt;
 -P:  Given a sequence file (-s) or an upper limit on the&lt;br /&gt;
      number of mers in the file (-n), compute the table size&lt;br /&gt;
      (-t in build) to minimize the memory usage.&lt;br /&gt;
         -m #          (size of a mer; required)&lt;br /&gt;
         -c #          (homopolymer compression; optional)&lt;br /&gt;
         -p            (enable positions)&lt;br /&gt;
         -s seq.fasta  (seq.fasta is scanned to determine the number of mers)&lt;br /&gt;
         -n #          (compute params assuming file with this many mers in it)&lt;br /&gt;
 &lt;br /&gt;
      Only one of -s, -n need to be specified.  If both are given&lt;br /&gt;
      -s takes priority.&lt;br /&gt;
 &lt;br /&gt;
 &lt;br /&gt;
 -B:  Given a sequence file (-s) and lots of parameters, compute&lt;br /&gt;
      the mer-count tables.  By default, both strands are processed.&lt;br /&gt;
         -f            (only build for the forward strand)&lt;br /&gt;
         -r            (only build for the reverse strand)&lt;br /&gt;
         -C            (use canonical mers, assumes both strands)&lt;br /&gt;
         -L #          (DON&amp;#039;T save mers that occur less than # times)&lt;br /&gt;
         -U #          (DON&amp;#039;T save mers that occur more than # times)&lt;br /&gt;
         -m #          (size of a mer; required)&lt;br /&gt;
         -c #          (homopolymer compression; optional)&lt;br /&gt;
         -p            (enable positions)&lt;br /&gt;
         -s seq.fasta  (sequence to build the table for)&lt;br /&gt;
         -o tblprefix  (output table prefix)&lt;br /&gt;
         -v            (entertain the user)&lt;br /&gt;
 &lt;br /&gt;
      By default, the computation is done as one large sequential process.&lt;br /&gt;
      Multi-threaded operation is possible, at additional memory expense, as&lt;br /&gt;
      is segmented operation, at additional I/O expense.&lt;br /&gt;
 &lt;br /&gt;
      Threaded operation: Split the counting in to n almost-equally sized&lt;br /&gt;
      pieces.  This uses an extra h MB (from -P) per thread.&lt;br /&gt;
         -threads n    (use n threads to build)&lt;br /&gt;
 &lt;br /&gt;
      Segmented, sequential operation: Split the counting into pieces that&lt;br /&gt;
      will fit into no more than m MB of memory, or into n equal sized pieces.&lt;br /&gt;
      Each piece is computed sequentially, and the results are merged at the end.&lt;br /&gt;
      Only one of -memory and -segments is needed.&lt;br /&gt;
         -memory mMB   (use at most m MB of memory per segment)&lt;br /&gt;
         -segments n   (use n segments)&lt;br /&gt;
 &lt;br /&gt;
      Segmented, batched operation: Same as sequential, except this allows&lt;br /&gt;
      each segment to be manually executed in parallel.&lt;br /&gt;
      Only one of -memory and -segments is needed.&lt;br /&gt;
         -memory mMB     (use at most m MB of memory per segment)&lt;br /&gt;
         -segments n     (use n segments)&lt;br /&gt;
         -configbatch    (create the batches)&lt;br /&gt;
         -countbatch n   (run batch number n)&lt;br /&gt;
         -mergebatch     (merge the batches)&lt;br /&gt;
      Initialize the compute with -configbatch, which needs all the build options.&lt;br /&gt;
      Execute all -countbatch jobs, then -mergebatch to complete.&lt;br /&gt;
        meryl -configbatch -B [options] -o file&lt;br /&gt;
        meryl -countbatch 0 -o file&lt;br /&gt;
        meryl -countbatch 1 -o file&lt;br /&gt;
        ...&lt;br /&gt;
        meryl -countbatch N -o file&lt;br /&gt;
        meryl -mergebatch N -o file&lt;br /&gt;
      Batched mode can run on the grid.&lt;br /&gt;
         -sge        jobname      unique job name for this execution.  Meryl will submit&lt;br /&gt;
                                  jobs with name mpjobname, ncjobname, nmjobname, for&lt;br /&gt;
                                  phases prepare, count and merge.&lt;br /&gt;
         -sgebuild &amp;quot;options&amp;quot;    any additional options to sge, e.g.,&lt;br /&gt;
         -sgemerge &amp;quot;options&amp;quot;    &amp;quot;-p -153 -pe thread 2 -A merylaccount&amp;quot;&lt;br /&gt;
                                  N.B. - -N will be ignored&lt;br /&gt;
                                  N.B. - be sure to quote the options&lt;br /&gt;
 &lt;br /&gt;
 -M:  Given a list of tables, perform a math, logical or threshold operation.&lt;br /&gt;
      Unless specified, all operations take any number of databases.&lt;br /&gt;
 &lt;br /&gt;
      Math operations are:&lt;br /&gt;
         min       count is the minimum count for all databases.  If the mer&lt;br /&gt;
                   does NOT exist in all databases, the mer has a zero count, and&lt;br /&gt;
                   is NOT in the output.&lt;br /&gt;
         minexist  count is the minimum count for all databases that contain the mer&lt;br /&gt;
         max       count is the maximum count for all databases&lt;br /&gt;
         add       count is sum of the counts for all databases&lt;br /&gt;
         sub       count is the first minus the second (binary only)&lt;br /&gt;
         abs       count is the absolute value of the first minus the second (binary only)&lt;br /&gt;
 &lt;br /&gt;
      Logical operations are:&lt;br /&gt;
         and       outputs mer iff it exists in all databases&lt;br /&gt;
         nand      outputs mer iff it exists in at least one, but not all, databases&lt;br /&gt;
         or        outputs mer iff it exists in at least one database&lt;br /&gt;
         xor       outputs mer iff it exists in an odd number of databases&lt;br /&gt;
 &lt;br /&gt;
      Threshold operations are:&lt;br /&gt;
         lessthan x            outputs mer iff it has count &amp;lt;  x&lt;br /&gt;
         lessthanorequal x     outputs mer iff it has count &amp;lt;= x&lt;br /&gt;
         greaterthan x         outputs mer iff it has count &amp;gt;  x&lt;br /&gt;
         greaterthanorequal x  outputs mer iff it has count &amp;gt;= x&lt;br /&gt;
         equal x               outputs mer iff it has count == x&lt;br /&gt;
      Threshold operations work on exactly one database.&lt;br /&gt;
 &lt;br /&gt;
         -s tblprefix  (use tblprefix as a database)&lt;br /&gt;
         -o tblprefix  (create this output)&lt;br /&gt;
         -v            (entertain the user)&lt;br /&gt;
 &lt;br /&gt;
      NOTE:  Multiple tables are specified with multiple -s switches; e.g.:&lt;br /&gt;
               meryl -M add -s 1 -s 2 -s 3 -s 4 -o all&lt;br /&gt;
      NOTE:  It is NOT possible to specify more than one operation:&lt;br /&gt;
               meryl -M add -s 1 -s 2 -sub -s 3&lt;br /&gt;
             will NOT work.&lt;br /&gt;
 &lt;br /&gt;
 &lt;br /&gt;
 -D:  Dump the table (not all of these work).&lt;br /&gt;
 &lt;br /&gt;
      -Dd        Dump a histogram of the distance between the same mers.&lt;br /&gt;
      -Dt        Dump mers &amp;gt;= a threshold.  Use -n to specify the threshold.&lt;br /&gt;
      -Dc        Count the number of mers, distinct mers and unique mers.&lt;br /&gt;
      -Dh        Dump (to stdout) a histogram of mer counts.&lt;br /&gt;
      -s         Read the count table from here (leave off the .mcdat or .mcidx).&lt;/div&gt;</summary>
		<author><name>Rf</name></author>	</entry>

	</feed>