Sophie

Sophie

distrib > Mandriva > 2010.0 > i586 > media > contrib-release > by-pkgid > ed4950ee216151219bf3841700ccc7f8 > files > 40

gp-0.26-6mdv2010.0.i586.rpm








<html><head><title>gp_mkmtx</title>

<link rev="made" href="mailto:january@bioinformatics.org">
</head>
<body bgcolor="#FFFFFF" link="#FF0000">

<hr>

<h1>gp_mkmtx</h1>
<h2>GP</h2>
<h2>2000</h2>


    
<h2>NAME</h2>
    gp_mkmtx - calculate frequencies of nucleotides
<p><h2>SYNOPSIS</h2>
    
    <strong>gp_mkmtx</strong> [-a] [-g value] [-l] [-q] [-v] [-d] [-h] [inputfile] [outputfile]
<p><h2>OPTIONS</h2>
    
<p><dl>
<p><p></p><dt><strong>-a</strong><dd> print only the absolute numbers of occurencies
<p><p></p><dt><strong>-g value</strong><dd> divide each frequency by the expected frequency at GC contents
		equal to <strong>value</strong> %.
<p><p></p><dt><strong>-l</strong><dd> do <strong>not</strong> apply logarythmic scaling (as a default, <strong>gp_mkmtx</strong>
		calculates the logarythm of the frequencies.
<p><p></p><dt><strong>-v</strong><dd> Prints the version information.
<p><p></p><dt><strong>-d</strong><dd> Prints lots of debugging information.
<p><p></p><dt><strong>-h</strong><dd> Shows usage information.
<p><p></p><dt><strong>inputfile</strong><dd> file to proces; if not given, will use standard input
<p><p></p><dt><strong>outputfile</strong><dd> file to write the data to; if not given, will
    use standard output
<p></dl>
<p><h2>DESCRIPTION</h2>
    
<p><strong>gp_mkmtx</strong> is supposed to be a tool for an easy creation of matrices for the
	<a href="gp_matrix.html"><strong>gp_matrix</strong></a> program. It takes a set of sequences,
	calculates the frequency of a nucleotide at each position starting from the
	first nucleotide and ending with the last nucleotide of the shortest sequence.
	For each position, four values are printed in a row, respectively for A, C, G
	and T/U. Each value is the logarithm of the calculated frequency
	(logarythmisation can be suppresed with the <strong>-l</strong> option). If the <strong>-g</strong>
	option is used, prior to the logarithmic scaling the values are diveded by the
	expected frequency at the given GC contents (that is, for example, at GC=50%,
	0.25 for each nucleotide). 
<p><h2>EXAMPLES</h2>
    
<p>gp_mkmtx -g 50 somesequence.fasta somesequence.mtx
<p>will produce a matrix file somesequence.mtx which, after some editing, will be
	directly suitable for the <a href="gp_matrix.html"><strong>gp_matrix</strong></a> program.
<p><h2>SEE ALSO</h2>
    
<a href="index.html">Genpak(1)</a> 
<a href="gp_acc.html">gp_acc(1)</a> 
<a href="gp_adjust.html">gp_adjust(1)</a> 
<a href="gp_cdndev.html">gp_cdndev(1)</a> 
<a href="gp_cusage.html">gp_cusage(1)</a> 
<a href="gp_digest.html">gp_digest(1)</a> 
<a href="gp_dimer.html">gp_dimer(1)</a> 
<a href="gp_findorf.html">gp_findorf(1)</a> 
<a href="gp_gc.html">gp_gc(1)</a> 
<a href="gp_getseq.html">gp_getseq(1)</a> 
<a href="gp_map.html">gp_map(1)</a> 
<a href="gp_matrix.html">gp_matrix(1)</a> 
<a href="gp_pattern.html">gp_pattern(1)</a> 
<a href="gp_primer.html">gp_primer(1)</a> 
<a href="gp_qs.html">gp_qs(1)</a> 
<a href="gp_randseq.html">gp_randseq(1)</a> 
<a href="gp_seq2prot.html">gp_seq2prot(1)</a> 
<a href="gp_slen.html">gp_slen(1)</a> 
<a href="gp_tm.html">gp_tm(1)</a> 
<a href="gp_trimer.html">gp_trimer(1)</a> 
<p><h2>DIAGNOSTICS</h2>
    
<p>All <strong>Genpak</strong> programs complain in situations you would also complain,
like when they cannot find a sequence you gave them or the sequence is not
valid. 
<p>The <strong>Genpak</strong> programs do not write over existing files. I have found this
feature very useful :-)
<p><h2>BUGS</h2>
    
<p>I'm sure there are plenty left, so please mail me if you find them. I tried
to clean up every bug I could find.
<p><h2>AUTHOR</h2>
    
<p>January Weiner III
		<a href="mailto:january@bioinformatics.org">&lt;january@bioinformatics.org&gt;</a>    
</body>
</html>