Login   Register  
PHP Classes

Class: Bot recognizer and dispatcher

Icontem
elePHPant

  Classes of Alexander Selifonov  >  Bot recognizer and dispatcher  >  Download  >  Support forum Support forum  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Bot recognizer and dispatcher Support forum
Base name: bot_recognizer
Description: Recognize and handle Web robot by IP or user agent
Version: 1.0
Required PHP version: 5.0
License: BSD License
 
  Author   Group folder image Groups   Detailed description   Freshmeat project  
  Applications   Files Files  

Author

Picture of Alexander Selifonov
Name: Alexander Selifonov <e-mail contact>
Published packages: 9 Browse this author's classes Browse this author's classes
Country: Russian Federation Russian Federation - PHP jobs in Russian Federation
Home page: http://www.selifan.ru/

Innovation Award

PHP Programming Innovation award nominee
October 2009
Number 2
If you want to detect whether the computer accessing a site is from a person, a search engine robot crawling your site or some kind a malicious program, you can check the user agent header.

However, that header can be forged with a misleading value to pretend that it is a search engine robot.

This class provides a more reliable solution that checks the IP address of the computer accessing the site against a list of IP address ranges known to be used by to search engine robots.

Manuel Lemos

Groups

Group folder image HTTP HTTP protocol clients, headers and cookies View top rated classes
Group folder image PHP 5 Classes using PHP 5 specific features View top rated classes
Group folder image Security Security protection and attack detection View top rated classes

Detailed description

This class can be used to recognize and handle Web robots by IP address or user agent.

It can check the IP address of the computer or the user agent of the browser program currently accessing the Web server to determine if it is within a range of IP addresses known to be of Web robots like search engine site crawlers or even malicious crawlers.

The class can call different callback functions depending on the type of crawler that was identified.

It can also be set on debug mode by taking a given IP address or string as user agent instead of the user agent string sent by the accessing browser.

The Web robots information is stored in a database. The class can load that database from a text data file. The database can be accessed using supplied database class or a database adapter class from the Zend Framework.

Freshmeat project

Project record: bot_recognizer

Applications that use this class

No application links were specified for this class.
Add link image If you know an application of this package, send a message to the author to add a link here.

Files

File Role Description
Accessible without login Plain text file bot_recognizer.php Class Main class module
Accessible without login Plain text file as_dbutils.php Aux. Database access wrapper, used in main module
Accessible without login Plain text file bot-defs.txt Data Initial bot definitions file
Accessible without login Plain text file import-botdefs.php Aux. Script for creating initial definition table in SQL database and import data from text source
Accessible without login Plain text file using-sample.php Example Using example script
Accessible without login HTML file bot_recognizer.en.htm Doc. Documentation (english)
Accessible without login HTML file bot_recognizer.ru.htm Doc. Documentation (russian)
Download all files: bot_recognizer.tar.gz bot_recognizer.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.