The concept “textual entailment” is a new approach that is applied in the natural language processing field. It is used to indicate the state in which the semantics of one natural language written text can be inferred from the semantics of another text. Specifically, if the truth of a text segment entails the truth of another text segment. For example, given the texts:
1. For their discovery of ulcer-causing bacteria, Australian doctors Robin Warren and Barry Marshall have received the 2005 Nobel Prize in Physiology or Medicine.
2. Robin Warren was awarded a Nobel Prize.
Monday, February 23, 2009
Sunday, February 22, 2009
P2P Web Search Engine.
ABSTRACT
The fast development of the World Wide Web and Dynamic nature makes it a challenge for searching and retrieving of information that is more recent. The WWW is a rapidly growing and changing information source. Its growth and change rates make the task of finding recent information harder.
The exponentially growing information published on the Web relies largely on a few major search engines like Google to be brought to the public nowadays. However, such an engine is not suitable for fresh information retrieval because it spends a long time to collect web pages by using a web robot (or crawler).
In the other hand, there are some distributed search engines such as Cooperative Search Engine (CSE), and so on. However, these distributed search engines are able to update in a very short time, e.g. a few minutes. But it has more communication delay.
In this project, in order to reduce the delay we proposed a P2P Web search that connects an a-priori unlimited number of peers, each of which maintains a personal local database and a local search facility.
Each peer posts a small amount of metadata to a physically distributed directory layered on top of a DHT-based overlay network that is used to efficiently select promising peers from across the peer population that can best locally execute a query.
Further, usage of Textual Entailment approach helps in searching the recent document for a given keyword.
The fast development of the World Wide Web and Dynamic nature makes it a challenge for searching and retrieving of information that is more recent. The WWW is a rapidly growing and changing information source. Its growth and change rates make the task of finding recent information harder.
The exponentially growing information published on the Web relies largely on a few major search engines like Google to be brought to the public nowadays. However, such an engine is not suitable for fresh information retrieval because it spends a long time to collect web pages by using a web robot (or crawler).
In the other hand, there are some distributed search engines such as Cooperative Search Engine (CSE), and so on. However, these distributed search engines are able to update in a very short time, e.g. a few minutes. But it has more communication delay.
In this project, in order to reduce the delay we proposed a P2P Web search that connects an a-priori unlimited number of peers, each of which maintains a personal local database and a local search facility.
Each peer posts a small amount of metadata to a physically distributed directory layered on top of a DHT-based overlay network that is used to efficiently select promising peers from across the peer population that can best locally execute a query.
Further, usage of Textual Entailment approach helps in searching the recent document for a given keyword.
Subscribe to:
Posts (Atom)