<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?action=history&amp;feed=atom&amp;title=Cegma</id>
		<title>Cegma - Revision history</title>
		<link rel="self" type="application/atom+xml" href="http://stab.st-andrews.ac.uk/wiki/index.php?action=history&amp;feed=atom&amp;title=Cegma"/>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Cegma&amp;action=history"/>
		<updated>2026-05-12T20:35:31Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.30.0</generator>

	<entry>
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?title=Cegma&amp;diff=490&amp;oldid=prev</id>
		<title>Rf at 16:13, 21 July 2016</title>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Cegma&amp;diff=490&amp;oldid=prev"/>
				<updated>2016-07-21T16:13:00Z</updated>
		
		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr style=&quot;vertical-align: top;&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;Revision as of 16:13, 21 July 2016&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&amp;#160;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= How to use Cegma=&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= How to use Cegma=&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-bwik:diff:version:1.11a:oldid:489:newid:490 --&gt;
&lt;/table&gt;</summary>
		<author><name>Rf</name></author>	</entry>

	<entry>
		<id>http://stab.st-andrews.ac.uk/wiki/index.php?title=Cegma&amp;diff=489&amp;oldid=prev</id>
		<title>Rf: Created page with &quot;  = How to use Cegma=  By default, Cegma expects that you will only analyse one fasta sequence each time. So it has certain standard names for its output files which &#039;&#039;&#039;do not...&quot;</title>
		<link rel="alternate" type="text/html" href="http://stab.st-andrews.ac.uk/wiki/index.php?title=Cegma&amp;diff=489&amp;oldid=prev"/>
				<updated>2016-07-21T16:11:37Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot;  = How to use Cegma=  By default, Cegma expects that you will only analyse one fasta sequence each time. So it has certain standard names for its output files which &amp;#039;&amp;#039;&amp;#039;do not...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
= How to use Cegma=&lt;br /&gt;
&lt;br /&gt;
By default, Cegma expects that you will only analyse one fasta sequence each time. So it has certain standard names for its output files which &amp;#039;&amp;#039;&amp;#039;do not&amp;#039;&amp;#039;&amp;#039; reflect the name of the input file at all.&lt;br /&gt;
&lt;br /&gt;
This means that if you have like-named files in the current directory, they will be overwritten. This is nasty behaviour but unfortunately something one must get used to in bioinformatic program development.&lt;br /&gt;
&lt;br /&gt;
However, this behaviour can be changed by way of a command-line option, &amp;#039;&amp;#039;&amp;#039;-o&amp;#039;&amp;#039;&amp;#039;, whereby a special prefix directory (given by the string following the &amp;#039;&amp;#039;&amp;#039;-o&amp;#039;&amp;#039;&amp;#039; option) is first created and all output files are deposited there. Therefore, one recommended way of running cegma is&lt;br /&gt;
&lt;br /&gt;
 cegma -o &amp;lt;myprefix&amp;gt; -g &amp;lt;myinputfile&amp;gt; -T 38&lt;br /&gt;
&lt;br /&gt;
So the output files will go into a directory with name &amp;lt;myprefix&amp;gt;, -g means genome, as in input genome, and finally -t is the number of threads. The threads will only produce parallelism among the cores of 1 computer, and even then, it is only used in parts of the cegma pipeline, such as hmmer&lt;br /&gt;
&lt;br /&gt;
Also, you should look out for three special environmental variables that, in the first case, genewise and in the second and third, cegma will require.&lt;br /&gt;
&lt;br /&gt;
  export WISECONFIGDIR=/share/apps/src/wise2.4.1/wisecfg&lt;br /&gt;
  export CEGMA=/share/apps/src/wise2.4.1/wisecfg&lt;br /&gt;
  export PERL5LIB=$PERL5LIB:$CEGMA/lib&lt;br /&gt;
&lt;br /&gt;
Here is an example jobscript for running on the cluster:&lt;br /&gt;
&lt;br /&gt;
 #!/bin/bash&lt;br /&gt;
 #&lt;br /&gt;
 #$ -cwd&lt;br /&gt;
 #$ -j y&lt;br /&gt;
 #$ -S /bin/bash&lt;br /&gt;
 #$ -V&lt;br /&gt;
 #$ -v PATH&lt;br /&gt;
 #$ -v LD_LIBRARY_PATH&lt;br /&gt;
 #$ -q noinfband.q,hp.q&lt;br /&gt;
 #$ -l highPriority&lt;br /&gt;
 echo &amp;quot;Node being used is ${HOSTNAME//+([[:alpha:]-.])}&amp;quot;&lt;br /&gt;
 module load cegma&lt;br /&gt;
 echo $CEGMA&lt;br /&gt;
 echo $PERL5LIB&lt;br /&gt;
 cegma -g $1 -T 20&lt;br /&gt;
&lt;br /&gt;
Another run to take care of output file names was as follows:&lt;br /&gt;
&lt;br /&gt;
  cegma -o rep${1%.*} -g $1 -T 20&lt;br /&gt;
&lt;br /&gt;
Oringally, the above script said -T 38, but this seems too high a number of threads, we have no single machine with that many.&lt;br /&gt;
&lt;br /&gt;
If you get tblastn errors, it&amp;#039;s because you do not have /share/apps/bin in your path. Even if you do have a proper $PATH, tblastn can continue to give errors. A very critical one is whereby tblastn and makeblastdb try to simply read the version of libz, but that libz doesn&amp;#039;t appear to repond to this request. Bear in mind that cegma was only tested on the 2.25 version of tblastn.&lt;br /&gt;
&lt;br /&gt;
=Installation=&lt;br /&gt;
&lt;br /&gt;
Genewise and geneid are requirements for cegma. And overall, too, genewise and cegma need to have glib. Glib is available on the &amp;#039;&amp;#039;&amp;#039;rpmforge repo&amp;#039;&amp;#039;&amp;#039;, so it needs to be enabled. To enable it:&lt;br /&gt;
 rrhy command=&amp;quot;sed -i &amp;#039;s/enabled = 0/enabled = 1/&amp;#039; /etc/yum.repos.d/rpmforge.repo&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Then, for all the nodes, it needs to be installed using yum:&lt;br /&gt;
&lt;br /&gt;
 yum install glib.x86_64 glib-devel.x86_64&lt;br /&gt;
&lt;br /&gt;
Be aware that there are some very similarly named libraries such as the version 2: &amp;#039;&amp;#039;&amp;#039;glib2&amp;#039;&amp;#039;&amp;#039; and &amp;#039;&amp;#039;&amp;#039;glibc&amp;#039;&amp;#039;&amp;#039;. &lt;br /&gt;
&lt;br /&gt;
== Genewise ==&lt;br /&gt;
&lt;br /&gt;
Compilation is hampered by what appears to be, poor testing on Linux in favour of full testing on Mac OS X. Here are some preparatory steps:&lt;br /&gt;
&lt;br /&gt;
* make clean&lt;br /&gt;
* perl -p -i -e &amp;#039;s/getline/getline2/g&amp;#039; HMMer2/sqio.c&lt;br /&gt;
* perl -p -i -e &amp;#039;s/isnumber/isdigit/g&amp;#039; models/phasemodel.c (version 2.2.23 no tiene este fichero)&lt;br /&gt;
&lt;br /&gt;
Some gotchas&lt;br /&gt;
* &amp;quot;make[1]: execvp: csh: Permission denied&amp;quot;: means you don&amp;#039;t have tcsh, the Berkeley C shell.&lt;br /&gt;
&lt;br /&gt;
Then you can go ahead and type &amp;#039;&amp;#039;&amp;#039;make all&amp;#039;&amp;#039;&amp;#039;. When this is complete, you can also test, thogh you do need to set and environment variable for it as follows:&lt;br /&gt;
 export WISECONFIGDIR=/share/apps/src/wise2.4.1/wisecfg&lt;br /&gt;
&lt;br /&gt;
You still could get possible errors from the program &amp;#039;&amp;#039;&amp;#039;dyc&amp;#039;&amp;#039;&amp;#039; not being available. glib, as mentioned before is also an issue with genewise. To find the package that the library binary belongs to, try&lt;br /&gt;
 rpm -qf /usr/lib64/glib-1.2.so.0&lt;br /&gt;
&lt;br /&gt;
And then to query the individual nodes:&lt;br /&gt;
 rrhy &amp;#039;rpm -q glib&amp;#039;&lt;br /&gt;
to see if they have it installed.&lt;br /&gt;
&lt;br /&gt;
On ubuntu, the confusion increases, it would seem that version 1and 2 are all in one. compilation of genewise gives compalints with &amp;#039;&amp;#039;&amp;#039;glib.h&amp;#039;&amp;#039;&amp;#039; not being found, although it&amp;#039;s usually  just a directory (some glib-2.0) inside /usr/include. Though on the cluster, the glib-config execuatable exists, this is not true on ubuntu. An early error is the glib.h error, but using the following instead of glib-config will get you past that.&lt;br /&gt;
&lt;br /&gt;
 pkg-config --cflags glib-2.0&lt;br /&gt;
&lt;br /&gt;
In actual fact, to compile glib c code which uses the hash tables, you need a compile and link line like this:&lt;br /&gt;
 &lt;br /&gt;
 gcc -Wall gha0.c -I/usr/include/glib-2.0 -I/usr/include/glib-2.0/glib -I/usr/lib/x86_64-linux-gnu/glib-2.0/include -L/usr/lib/x86_64-linux-gnu/lib -lglib-2.0&lt;br /&gt;
&lt;br /&gt;
AS you can see, ubuntu has created this extra subdirectory &amp;#039;&amp;#039;&amp;#039;x86_64-linux-gnu&amp;#039;&amp;#039;&amp;#039; in which to hide things.&lt;br /&gt;
&lt;br /&gt;
However there are even more problems. Genewise clearly was not built first on ubuntu: Here&amp;#039;s a link &lt;br /&gt;
http://www.langebio.cinvestav.mx/bioinformatica/jacob/?p=709&lt;br /&gt;
The binaries are as follows:&lt;br /&gt;
# scanwise_server&lt;br /&gt;
# scanwise&lt;br /&gt;
# pswdb&lt;br /&gt;
# psw&lt;br /&gt;
# promoterwise&lt;br /&gt;
# genewisedb&lt;br /&gt;
# genewise&lt;br /&gt;
# estwisedb&lt;br /&gt;
# estwise&lt;br /&gt;
# dnal&lt;br /&gt;
# dba&lt;br /&gt;
&lt;br /&gt;
== Cegma ==&lt;br /&gt;
&lt;br /&gt;
Get the package&lt;br /&gt;
 wget http://korflab.ucdavis.edu/datasets/cegma/cegma_v2.4.010312.tar.gz&lt;br /&gt;
&lt;br /&gt;
The binaries are as follows:&lt;br /&gt;
# cegma&lt;br /&gt;
# completeness&lt;br /&gt;
# geneid-train&lt;br /&gt;
# genome_map&lt;br /&gt;
# hmm_select&lt;br /&gt;
# local_map&lt;br /&gt;
# make_paramfile&lt;br /&gt;
# parsewise&lt;/div&gt;</summary>
		<author><name>Rf</name></author>	</entry>

	</feed>