WISE Classifier


Background

 

The number of digital documents available in corporate intranets, digital libraries, commercial data bases or the Web as a whole is vastly growing in size. The sheer volume makes it often prohibitively expensive, if not impossible, to rely on librarians or other domain experts to efficiently annotate and categorize content. Consequently, automated document categorization is an important component in many information systems.

Overview

WISE Classifier is the best tool to automatically classify large scale documents into appropriate categories and
gives the user higher level logical view. You can get a quite effective and convenient experience by using WISE
Classifier, for example, separate taxonomies can be created for the sales, marketing, human resources, and
engineering departments, and documents can be classified automatically to the hierarchy of these taxonomies.
This structure information adds a valuable dimension to the content discovery process.


Special Features

 ■ Accurate classification

    - Provide state-of-the-art supervised machine learning technology

    - Rule customization helps the accuracy improvement

 ■ Flexibility and stability

    - Can be connected with existing system

    - Can form basis for more advanced systems

 ■ Effective Management

    - Web based user friendly UI

    - Convenient configuration management

    - Convenient I/O process


Example cases

 ■ Automatic classification of news and other media posts

    - Automatic categorization of continuously increasing news

    - Precise classification based on contents

 ■ Classification for rumor tracking

    - The rumor is classified in speedy manner

    - Filters out the false rumors in real-time

 ■ CRM Automatic classification

    - Automatic classification of customer complaints/suggestions

    - Delivery to most appropriate department

 ■ Automatic classification of scientific documents

    - Precise classification based on content

    - Faceted Navigation

 ■ Automatic classification of email

    - Classifies email based on content and puts in specified received folders

      (not based on simple keyword match)

    - Filters out the useful email from loads of email

    - Sophisticated spam filtering

 ■ Automatic classification for complaint management

    - Automatically classifies the complaints lodged for public organizations

    - Forwards the complaint to most appropriate department for further handling

    - Offers best and automatic reply for complaints


Main Functionalities

 ■ Taxonomy management

    - Taxonomy can be built manually or extracted from documents automatically

    - Category hierarchy can be built automatically with populated training documents

 ■ Learning based document classification

    - Automatically classify documents with state-of-the-art supervised machine learning algorithms

    - High performance on large scale documents

 ■ Rule based document classification

    - Support for rule definition, rule boolean operation

    - Automatic rule induction from documents

 ■ Search
   
- Support for category based search 

    - Support for incremental index update


System Architecture

 

2009/04/14 09:42 2009/04/14 09:42
Response
No Trackback , No Comment
RSS :
http://en.wisenut.com/rss/response/19

Search Formula-1

Background

 

In this information explosion era, the creation of data in internet, intranet or personal computers are increasing at gigantic rate every day. How to effectively organize and find the needed information from the mass data repositories has become a key for competitiveness in business and daily life.
A simple keyword matching search is far from enough to satisfy the information need. This era demands a much more intelligent search engine that could exploit the deep "knowledge" hidden in the sea of data to help users find information and make their decisions.


Overview

Search Formula-1(SF-1) is designed to give an integrated solution to find information in a variety of environments, including internet, intranet and personal computers. It is an intelligent search engine in that it does much data-mining analysis to guide the user describes their information need, assist the searching, ranking and presentation of the search result. The aim is to provide the information that best fits the users' need and help users to make informative decisions. At the same time, it is built to handle large amount of data in a very efficient manner with effective system resource utilization. The product has achieved highest market share and has proven its worth in multitude of fields. Having wide industrial experience in hand, we are well aware of your search needs and would deliver the best and optimized solution for your operating environment.


Special Features

 ■ Data mining support

    Provides a bundle of data mining components to assist making an intelligent search result. For example,
    it supports search result clustering, taxonomy navigation, duplicate document detection, text summarization,
    similar document search, query correction, query recommendation, automatic query completion, etc.

 ■ Quality search results

    Offers variety of effective sorting and ranking mechanisms. Apart from support of many built in state of art
    algorithms, functionality is provided for user to define its own sorting and ranking strategies.

 ■ Real-time update

    Information is being produced all the time. Access is also provided in real time. SF-1 could update the index
    in real time without any need for restart. Moreover this data is loaded in memory to give high speed experience.

 ■ Support of multitude of data formats

    SF-1 supports multiple platforms (Win 32, Sun-OS, Linux, HPUS, AIX) connects to multiple data sources
    (DBMS, WWW, XML, Notes, MS Ex-change etc.) and supports variety of data formats (word, PDF, text etc.).

 ■ High Scalability and platform independence

    Setup file is in universal XML standard which makes independent of any single platform and makes upgrade
    a simple process. SF-1 is built with 3-tier highly scalable architecture and supports multiple platforms thus
    ensuring that SF-1 supports your needs well into future.

 ■ Stable and efficient system architecture

    SF-1 is built for distributed processing and uses redundancy and index compression to make it fail safe and
    efficient. With use of multi processing and multi threading system processing resources are utilized to the
    fullest and index compression makes sure that disk utilization is optimized.

■ Intuitive and convenient management tool
    Power of system cannot be utilized fully if the management system is complicated and arcane. SF-1 provides
    intuitive management tool which makes managing a simple and convenient process. System monitoring,
    module management, statistics and dictionary management requires a few clicks only.

 

Supporting platforms and recommended system

Supported
Platforms
  Windows, Linux, Solaris, AIX, Unix...
  API Support : JAVA, COM, C/C++, ASP/.NET, PHP API
Recommended
Specifications
  CPU : Pentium 4 Xeon 2.4GHz x 2ea
  Memory : 4G Bytes
  Disk : 80G SCSI x 1ea
  OS : Windows Server 2003 Standard Edition

System Architecture

2009/04/13 16:10 2009/04/13 16:10
Response
No Trackback , No Comment
RSS :
http://en.wisenut.com/rss/response/18

WISE IF



Background

  Currently the users use crawlers to crawl web pages from the Internet and extractors to extract information
  they need. But no existing robots can the two together since the web pages are of various formats.
  While the need of a robot, who can crawl the web pages user need and extract accurate information from
  the pages crawled, is increasing.


Overview

  IF is such a system to meet the requirement. Through rule definition, IF can accurately know what kind of
  page to crawl and what information to extract. It's a powerful tool to help the user to find and collect accurate
  information.

Special Features

 

 ■ High performance and high quality

    - Gives best access to information and best content gathering functionality

    - Collects the data chosen by customer with precision

    - Can collect data from various sources including java script, certified pages and many other formats

 ■ Convenience of use

    - Convenient interface for management and use.

    - Combination of rule based and automatic collection process

    - Web based tool for Collection, Analysis and Storage

 ■ Stability

    - Stable and convenient system

    - Speedy processing of large scale data

    - Management of dead links to decrease the error in collection process


Main Features

 ■ Rule register

    Through IF, the user can decide what kind of page to crawl and information to extract. Rule register is
    an application with a web explorer built in, together with some other components. When browsing the web
    pages, the user can easily define the crawling rules, extracting rules and so on.

 ■ Crawler and extractor

    After the user define the rated rules, the crawler will crawl the pages according to the crawling rules and
    extractor will extract information according to the extracting rules. The information extracted will be saved
    into the database in a format pre-defined.

 ■ Web management tool

    A web tool to view statistic information of crawling tasks and so on.

 ■ Exporter

    The user can export the information from the database to a file or another database in the format defined.


System Architecture

 

2009/04/06 18:15 2009/04/06 18:15
Response
No Trackback , No Comment
RSS :
http://en.wisenut.com/rss/response/17