<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> <!-- ***** GENERATED FILE *** DO NOT EDIT DIRECTLY - any changes will be LOST ****** swish-e.org mockup based on http://www.oswd.org/design/1773/prosimii/index2.html --> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en-US"> <head> <meta http-equiv="content-type" content="text/html; charset=iso-8859-1" /> <link rel="stylesheet" type="text/css" href="./swish.css" media="screen" title="swish css" /> <link rel="shortcut icon" href="/favicon.ico" type="image/x-icon" /> <link rel="Last" href="./filter.html" /> <link rel="Prev" href="./swish-faq.html" /> <link rel="Up" href="./index.html" /> <link rel="Next" href="./swish-3.0.html" /> <link rel="Start" href="./index.html" /> <link rel="First" href="./readme.html" /> <title>Swish-e :: SWISH-BUGS - List of bugs known in Swish-e</title> </head> <body> <!-- noindex --> <!-- For non-visual user agents: --> <div id="top"><a href="#main-copy" class="doNotDisplay doNotPrint">Skip to main content.</a></div> <!-- ##### Header ##### --> <div id="header"> <div class="superHeader"> <span>Related Sites:</span> <a href="http://swishewiki.org/" title="swishe wiki">swish-e wiki</a> | <a href="http://www.xmlsoft.org/" title="libxml2 home page">libxml2</a> | <a href="http://www.zlib.net/" title="zlib home page">zlib</a> | <a href="http://www.foolabs.com/xpdf/" title="xpdf home page">xpdf</a> | <a href="http://dev.swish-e.org/browser" title="browse source code">Subversion</a> </div> <div class="midHeader"> <h1 class="headerTitle" lang="la">Swish-e</h1> <div class="headerSubTitle">Simple Web Indexing System for Humans - Enhanced</div> <br class="doNotDisplay doNotPrint" /> <div class="headerLinks"> <span class="doNotDisplay">Tools:</span> <!-- don't know what platform, so link to download page --> <a href="http://swish-e.org/download/index.html">download latest version</a> </div> </div> </div> <!-- index --> <!-- noindex --> <div class="subHeader"> <table width='100%'> <tr> <td align='left'> <a href="http://swish-e.org/index.html">home</a> | <a href="http://swish-e.org/support.html">support</a> | <a href="http://swish-e.org/download/index.html">download</a> </td> <td align='right'> <form method="get" action="http://swish-e.org/search/index.html" enctype="application/x-www-form-urlencoded" class="srchform"> <label for="searchfield">Search for</label> <input maxlength="200" value="" id="searchfield" size="30" name="query" type="text" alt="Search input field" /> <input value="search swish-e.org" name="submit" type="submit" class='button' /> </form> </td> </tr> </table> </div> <!-- index --> <div id="body-area" class="clearfix"> <div id="content-area"> <div id="main-copy"> <h1>SWISH-BUGS - List of bugs known in Swish-e</h1> Swish-e version 2.4.7 <!-- noindex --> <h2>Table of Contents</h2> <div class="toc"> <ul class="toc"> <li> <a href="#description">DESCRIPTION</a> </li> <li> <a href="#bugs_in_swish_e_version_2_4">Bugs in Swish-e version 2.4</a> </li> <li> <a href="#document_info">Document Info</a> </li> </ul> </div> <!-- index --> <hr /> <div class="sub-section"> <h1><a name="description"></a>DESCRIPTION</h1> <p>This file contains a list of bugs reported or known in Swish-e. If you find a bug listed here you do not need to report it as a bug. But feel free to bug the developers about it on the Swish-e discussion list.</p> <p>Note that this list is imcomplete and may not be up to date.</p> </div> <div class="sub-section"> <h1><a name="bugs_in_swish_e_version_2_4"></a>Bugs in Swish-e version 2.4</h1> <ul> <li><a name="item_stopwords"></a><a name="stopwords"></a><b>Stopwords not removed from query with Soundex</b> <p>In dev version 2.5.2 noticed that stopwords are not removed from the query when using Soundex. The plan is to rewrite the parser soon... (July 2004)</p> </li> <li><a name="item_wild"></a><a name="wild"></a><b>Wild card searching can be very slow</b> <p>Wild card searching needs to be optimized.</p> <p>Here's a three letter search:</p> <pre class="pre-section"> $ swish-e -w 'tra*' -m1 # Number of hits: 99952 # Search time: 5.424 seconds</pre> <p>Two letters:</p> <pre class="pre-section"> $ swish-e -w 'tr*' -m1 # Number of hits: 100000 # Search time: 10.563 seconds</pre> <p>Single letter search:</p> <pre class="pre-section"> $ swish-e -w 't*' -m1 # Number of hits: 100000 # Search time: 510.939 seconds</pre> <p>and used about 280MB or RAM.</p> <p>This is a potential for a DoS attack. If you have a large index you may wish to filter out single character wild cards.</p> </li> <li><a name="item_character"></a><a name="character"></a><b>Character Encodings</b> <p>The XML parser (Expat) returns UTF-8 data to swish-e. Therefore, the XML parser should only be used for parsing US-ASCII encoded text.</p> <p>The XML2 & HTML2 parsers (Libxml2) converts characters from UTF-8 to 8859-1 encodings before indexing and writing properties. Indexing non-8859-1 data may result in invalid character mappings.</p> <p>These issues will be resolved soon.</p> </li> <li> <p>Phrase search failes with DoubleMetaphone</p> <p>DoubleMetaphone searching can produce two search words for a single query word. The words are expanded to (word1 OR word2), but that fails in a phrase query: "some phrase (word1 or word2) here"</p> <p>swish-e query parser is due for a rewrite, and this could be resolved then.</p> <pre class="pre-section"> Reported: August 20, 2002 - moseley</pre> </li> <li> <p>Merging</p> <p>merge.c does not check for matching stopwords or buzzwords in each index.</p> <p>History:</p> <pre class="pre-section"> Reported: September 3, 2002 - moseley</pre> </li> <li> <p>ResultSortOrder</p> <p>ResultSort order is not used (and is not documented). The problem is that the data passed to Compare_Properties() does not have access to the ResultSortOrder table.</p> </li> </ul> <p>History:</p> <pre class="pre-section"> Reported: September 3, 2002 - moseley</pre> </div> <div class="sub-section"> <h1><a name="document_info"></a>Document Info</h1> <p>$Id: SWISH-BUGS.pod 1613 2005-02-02 22:53:39Z whmoseley $</p> <p>.</p> </div> </div><!-- /#main-copy --> </div><!-- /#content-area --> <div id="side-bar"> <!-- noindex --> <ul class="menu"><li class="menuparent"> <a class="menu" href="./index.html" >Doc Overview</a> <!-- noindex --> <ul class="submenu"><li class=""> <a class="submenu" href="./readme.html" title="First time users">README</a> </li><li class=""> <a class="submenu" href="./install.html" title="Installation and usage overview">Install</a> </li><li class=""> <a class="submenu" href="./changes.html" title="Important changes from previous versions">Changes</a> </li><li class=""> <a class="submenu" href="./swish-config.html" title="Directives that go in your Swish-e configuration file">Configuration</a> </li><li class=""> <a class="submenu" href="./swish-run.html" title="Command line options for Swish-e binary">Running</a> </li><li class=""> <a class="submenu" href="./swish-search.html" title="Swish-e's search language">Searching</a> </li><li class=""> <a class="submenu" href="./swish-faq.html" >FAQ</a> </li><li class=""> <a class="thisfile" href="./swish-bugs.html" >Known issues »</a> </li><li class=""> <a class="submenu" href="./swish-3.0.html" >The Future</a> </li><li class=""> <a class="submenu" href="./swish-library.html" title="Swish-e C API">C API</a> </li><li class=""> <a class="submenu" href="./api.html" title="Perl interface to the Swish-e library">Perl API</a> </li><li class=""> <a class="submenu" href="./swish.cgi.html" title="Example CGI/mod_perl script">Swish.cgi</a> </li><li class=""> <a class="submenu" href="./search.cgi.html" title="Example Perl script using SWISH::API">Search.cgi</a> </li><li class=""> <a class="submenu" href="./spider.html" title="The Swish-e HTTP spider">Spider.pl</a> </li><li class=""> <a class="submenu" href="./filter.html" title="How to index non-text documents">Filters</a> </li></ul> <!-- index --> </li></ul> <!-- index --> </div><!-- /#side-bar --> </div><!-- /#body-area --> <div id="footer"> <!-- noindex --> <span class="doNotPrint"> Swish-e is distributed with <strong>no warranty</strong> under the terms of the <br /> <a href='http://swish-e.org/license.html'>Swish-e License</a>.<br /> Questions may be posted to the <a href="http://swish-e.org/discuss.html" title="email list and list archive">Swish-e Discussion list</a>. </span> <p> <strong>URI »</strong> http://swish-e.org/ • <strong>Generated »</strong>Sun, 05 Apr 2009 02:02:28 UTC</p> <!-- index --> </div><!-- /#footer --> </body> </html>