Paper Presentation and Final Report on Compounded Uniqueness Level: Geo-Location Indexing Using Address Parser.
Abstract on this project :- Geo-location searching is an important feature for any search engine and research in this field is not new. The only issue that remains is how a search engine know whether a web page belongs to India or the USA? URLs ending with [.in] are the ultimate choice for India but not all web sites from India end with [.in]. This paper describes a technology known as the address parser.
The address parser searches for patterns in a web page that communicates address information. The address parser does not parse every web page of a website for extracting the address but only works on those URLs where the probability of finding an address of the website owner is maximum, thereby eliminating false positives. A central knowledge base is built manually, which contains information like States of a country followed by their city names and other relevant information that may help the address parser do precise local indexing. It was observed that the address parser was not only able to recognize the address patterns in the web pages but also indexed them to city specific information. As a result, a person located in Gangtok, Sikkim, India searched for [universities]; the searching module showed the link of [Sikkim Manipal University] first, followed by other links from India.
You can also Subscribe to PROJECTSWORLDS by Email for more such projects and seminar topics.
Keywords: Address Parser, Geo-location Indexing, Information Retrieval, Localized Searching.
For more updates on Projects via E-mail or Sms Subscribe to www.projectsworlds.blogspot.com