Sophie

Sophie

distrib > Mandriva > 2010.0 > i586 > media > contrib-release > by-pkgid > 5e1854624d3bc613bdd0dd13d1ef9ac7 > files > 1392

gap-system-4.4.12-5mdv2010.0.i586.rpm

<!-- #################################################################### -->
<!-- ##                                                                ## -->
<!-- ##  timings.xml        FactInt documentation         Stefan Kohl  ## -->
<!-- ##                                                                ## -->
<!-- ##  $Id: timings.xml,v 1.3 2007/09/19 09:35:21 stefan Exp $       ## -->
<!-- ##                                                                ## -->
<!-- #################################################################### -->

<Chapter Label="ch:Timings">
<Heading>How much Time does a Factorization take?</Heading>

<!-- #################################################################### -->

<Section Label="sec:TimingsForTheGeneralFactorizationRoutine">
<Heading>Timings for the general factorization routine</Heading>

A few words in advance: In general, it is not possible to give a precise
prediction for the CPU time needed for factoring a given integer.
This time depends heavily on the sizes of the factors of the given number
and on some other properties which cannot be tested before actually
doing the factorization.
But nevertheless, rough runtime estimates can be given for numbers
with factors of given orders of magnitude. <P/>

After casting out the small and other <Q>easy</Q> factors -- which should
not take more than at most a few minutes for numbers of
<Q>reasonable</Q> size -- the general factorization routine uses first ECM
(see&nbsp;<Ref Func="FactorsECM"
               Label="Elliptic Curves Method, ECM"/>)
for finding factors very roughly up to
the third root of the remaining composite and then the MPQS
(see&nbsp;<Ref Func="FactorsMPQS"
               Label="Multiple Polynomial Quadratic Sieve, MPQS"/>)
for doing the <Q>rest</Q> of the work.
The latter is often the most time-consuming part. <P/>

In the sequel, some timings for the ECM and for the MPQS are given.
These methods are by far the most important ones with respect to
runtime statistics (the <M>p \pm 1</M>-methods
(see&nbsp;<Ref Func="FactorsPminus1" Label="Pollard's p-1"/> and
          <Ref Func="FactorsPplus1"  Label="Williams' p+1"/>)
are only suitable for finding factors with certain properties and CFRAC
(see&nbsp;<Ref Func="FactorsCFRAC"
               Label="Continued Fraction Algorithm, CFRAC"/>)
is just a slower predecessor of the MPQS).
All absolute timings are given for a Pentium&nbsp;200 under Windows
as a reference machine (this was a fast machine at the time the first
version of this package has been written).

</Section>

<!-- #################################################################### -->

<Section Label="sec:TimingsForTheECM">
<Heading>Timings for the ECM</Heading>

The runtime of <C>FactorsECM</C> depends mainly on the size of the factors
of the input number.
On average, finding a 12-digit factor of a 100-digit number
takes about 1&nbsp;min&nbsp;40&nbsp;s, finding a 15-digit factor of
a 100-digit number takes about 10&nbsp;min and finding an 18-digit factor
of a 100-digit number takes about 50&nbsp;min.
A&nbsp;general rule of thumb is the following: one digit more increases the
runtime by a bit less than a factor of two.
These timings are very rough, and they may vary by a factor of 10 or more.
You can compare trying an elliptic curve with throwing a couple of dice,
where a success corresponds to the case where all of them show the same
side -- it is possible to be successful with the first trial, but it is
also possible that this takes much longer. In particular, all trials are
independent of one another.
In general, ECM is superior to Pollard's Rho for finding factors with at
least 10 decimal digits. In the same time needed by Pollard's Rho
for finding a 13-digit factor one can reasonably expect to find a
17-digit factor when using ECM, for which Pollard's Rho in turn would
need  about 100 times as long as&nbsp;ECM.
For larger factors this difference grows rapidly.
From theory it can be said that finding a 20-digit factor requires
about 500 times as much work as finding a 10-digit factor,
finding a 30-digit factor requires about 160 times as much work as
finding a 20-digit factor and finding a 40-digit factor requires
about 80 times as much work as finding a 30-digit factor. <P/>

The default parameters are optimized for finding factors with about
15 -- 35 digits. This seems to be a sensible choice, since this is the
most important range for the application of ECM. The function
<C>FactorsECM</C> usually gives up when the input number&nbsp;<M>n</M>
has two factors which are both larger than its third root.
This is of course only a <Q>probabilistic</Q> statement. Sometimes --
but seldom -- the remaining composite has 3&nbsp;factors, 4&nbsp;factors
should occur (almost) never. <P/>

The user can of course specify other parameters than the default ones,
but giving timings for all possible choices is obviously impossible.
The interested reader should follow the references given in the
bibliography at the end of this manual for getting information on how
many curves with which parameters are usually needed for finding
factors of a given size. This depends mainly on the distribution of
primes, respectively of numbers with prime factors not exceeding a
certain bound. <P/>

For benchmarking purposes, the amount of time needed for trying
a single curve with given smoothness bounds for a number of given size
is suited best.
A typical example is the following: one curve with
(<A>Limit1</A>,<A>Limit2</A>) = (100000,10000000) applied to a 100-digit
integer requires a total of 10&nbsp;min&nbsp;20&nbsp;s, where
6&nbsp;min&nbsp;45&nbsp;s are spent for the first stage and
3&nbsp;min&nbsp;35&nbsp;s are spent for the second stage. The time needed
for the first stage is approximately linear in <A>Limit1</A> and the time
needed for the second stage is a bit less than linear in <A>Limit2</A>.

</Section>

<!-- #################################################################### -->

<Section Label="sec:TimingsForTheMPQS">
<Heading>Timings for the MPQS</Heading>

The runtime of <C>FactorsMPQS</C> depends only on the size of the input
number, and not on the size of its factors.
Rough timings are as follows: 90&nbsp;s for a 40-digit number, 10&nbsp;min
for a 50-digit number, 2&nbsp;h for a 60-digit number, 20&nbsp;h for
a 70-digit number and 100&nbsp;h for a 75-digit number.
These timings are much more precise than those given for ECM, but they
may also vary by a factor of 2 or&nbsp;3 depending on whether a good factor
base can be found without using a large multiplier or not.
A general rule of thumb is the following: 10&nbsp;digits more cause
10&nbsp;times as much work. For benchmarking purposes, precise timings
for some integers are given: <M>38!+1</M> (45&nbsp;digits, good factor base
with multiplier&nbsp;1): 2&nbsp;min&nbsp;22&nbsp;s, <M>40!-1</M>
(48&nbsp;digits, not so good factor base even with multiplier&nbsp;7):
8&nbsp;min&nbsp;58&nbsp;s, cofactor of <M>1093^{33}+1</M> (61&nbsp;digits,
good factor base with multiplier&nbsp;1): 1&nbsp;h&nbsp;12&nbsp;min.

</Section>

<!-- #################################################################### -->

</Chapter>

<!-- #################################################################### -->