Sphider-plus version 4.2021b - The PHP Search Engine

All required information.

[ Change Log Summary ]


- Actual release:    4.2021b

- Former versions:

          Version 4.2021a

          Version 3.2020d       Version 3.2020c

          Version 3.2020b       Version 3.2020a

          Version 3.2019c

          Version 3.2019b       Version 3.2019a

          Version 3.2018b       Version 3.2018a

          Version 3.2017b        Version 3.2017a


          Version 3.2016d        Version 3.2016c

          Version 3.2016b        Version 3.2016a


          Version 3.2015e        Version 3.2014c

          Version 3.2015d        Version 3.2014b

          Version 3.2015c        Version 3.2014a

          Version 3.2015b        Version 3.2013b

          Version 3.2015a        Version 3.2013a



- Older versions:


          Version 2.9          Version 1.9

          Version 2.8          Version 1.8

          Version 2.7          Version 1.7

          Version 2.6          Version 1.6

          Version 2.5          Version 1.5

          Version 2.4          Version 1.4

          Version 2.3          Version 1.3

          Version 2.2          Version 1.2

          Version 2.1          Version 1.1

          Version 2.0          Version 1.0


Version v.2.3

Release date: April 23, 2010

Build up with Sphider: v.1.3.5

In order to ease customer's integration of Sphider-plus into existing sites, HTML templates are prepared for

- Search form

- Text results

- Media results

- Most popular queries

- etc.

New feature:

Allow indexing of other hosts with same domain name for links found during indexing. Also ignore TLD, SLD and www.

More details in documentation chapter: Allow other hosts in same domain

New feature:

Allow indexing of other hosts with same domain name but only if the found links are redirected. Also ignore TLD, SLD and www.

More details in documentation chapter: Allow other hosts in same domain

New feature:

Index sites and follow links containing none ‘Basic Latin’ and none ASCII characters as part of their URL.

2 new features of sorting the result listing:

- Results of a promoted / featured domain will be displayed on top of the search result listing.

   As part of the Admin settings, a domain name or part of the name could be entered.

   All search results belonging to this domain will be placed on top of result listing.

- Pages containing a catchword will be displayed on top of the search result listing.

   As part of the Admin settings, the catchword could be entered.

More details in documentation chapter: Chronological order for result listing

New feature:

Split words into their basic parts, separated at each hyphen, dot or comma inside the words.

For example 'sphider-plus.eu' will be divided into the 3 keywords: sphider plus eu

As also the original word is stored as keyword, all 4 words become searchable.

Alternatively the separation only at hyphens is selectable in Admin settings.

New feature:

Index the "Description" Meta tag in HTML header. To be activated in Admin settings.

New feature:

Index of media files enabled for those servers that do not offer all PHP functions for remote files.

Bypassed PHP functions are: fopen(); file_get_contents(); md5_file();

3 new features for command line operation:

- Erase & Re-index all sites ( -eall )

- Index all new URLs in database which had not jet been indexed ( -new )

- Re-index all meanwhile erased sites ( -erased )

New feature:

In order to index XLS files, a converter for Exel files was developed. Implemented as PHP script,

the converter needs no adoption to the Operating System.

New Admin setting:

Index RAR compressed files and archives.

Supports (X)HTML, XML and also compressed PDFs and other document files, as well as all kind of feeds,

frames and iframes. Links found in the compressed files will be followed.

15 language specific stemming algorithms implemented. Individually selectable for:

Bulgarian, Chinese, Czech, Dutch, English, Finnish, French, German,

Greek, Hungarian, Italian, Portuguese, Russian, Spanish and Swedish.

For details see chapter Word stemming

More details in documentation chapter: Word stemming

New Admin setting:

Activate/disable: Create 'sitemap.xml' file of each indexed site.

New Site option in Admin menu:

Erase/clean site-specific data from MySQL database and thumbnails folder for a selected site.

New Admin setting:

Re-index all meanwhile erased sites.

New Admin setting:

Show complete list during import and export of URLs, or hide output.

24 language specific common files holding a list of words to be ignored during index (stop words).

Added or updated for:

Arabic, Bengali, Bulgarian, Catalan, Czech, Danish, Dutch, English,

Farsi, Finnish, French, Greek, German, Hindi, Hungarian, Italian, Norwegian,

Polish, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

In order to speed up index procedure, they are to be activated individually in Admin settings.

New feature:

Self test for all required PHP libraries and extensions. If Debug mode is enabled,

the corresponding warning messages will be presented on top of the Settings menu.

Improved database 'Activate / Disable' menu:

If multiple sets of tables are available, because they have been created for a database before,
you will be able to activate any of these table sets by selecting the corresponding prefix.

New Admin setting:

Define directory for templates (relative to root directory of Sphider-plus)

Search and open media files enabled now for media links with up to 1024 characters.

Input settings for database configuration menus are now enabled for values up to 255 characters.

'Clean resources' improved for index procedure.

In case of failure, only warning messages will be created and indexing will not be aborted.

The feature 'Clean resources' is added now also for search procedure.

Common activation in Admin settings for search and index procedure.

If debug mode is enabled, during index procedure, the new keywords are presented in alphabetic order now.

Follow ‘robots.txt’ directive enabled also for localhost applications.

Bug fixed that causes the result listing to be presented only in lower case characters.

Now presented like the original title and full text of the indexed pages.

Some more small bugs eliminated.

Updated Admin dialog. Thanks to Ian Bucklar.

Additional language file for Hebrew. Thanks to Noam Bercovitz.

Updated Russian languge file. Thanks to Uttkirbek Abdullaev.

Updated Romanian language file. Thanks to Lionel Geo Mischie.

Involved files that have been modified / added for this release:















.../converter/xls2csv.exe (not required any longer)




.../include/common/all common_xyz.txt files

.../include/stemming/all files





.../templates/all folder/thisstyle.css

.../templates/html/all files