An active crawler for discovering geospatial Web services and their distribution pattern – A case study of OGC Web Map Service
2010; Taylor & Francis; Volume: 24; Issue: 8 Linguagem: Inglês
10.1080/13658810903514172
ISSN1365-8824
AutoresWenwen Li, Chaowei Yang, Chongjun Yang,
Tópico(s)Web Data Mining and Analysis
ResumoAbstract The increased popularity of standards for geospatial interoperability has led to an increasing number of geospatial Web services (GWSs), such as Web Map Services (WMSs), becoming publicly available on the Internet. However, finding the services in a quick and precise fashion is still a challenge. Traditional methods collect the services through centralized registries, where services can be manually registered. But the metadata of the registered services cannot be updated timely. This paper addresses the above challenges by developing an effective crawler to discover and update the services in (1) proposing an accumulated term frequency (ATF)–based conditional probability model for prioritized crawling, (2) utilizing concurrent multi-threading technique, and (3) adopting an automatic mechanism to update the metadata of identified services. Experiments show that the proposed crawler achieves good performance in both crawling efficiency and results' coverage/liveliness. In addition, an interesting finding regarding the distribution pattern of WMSs is discussed. We expect this research to contribute to automatic GWS discovery over the large-scale and dynamic World Wide Web and the promotion of operational interoperable distributed geospatial services. Keywords: geospatial Web service (GWS)crawlerWeb Map Service (WMS)accumulated term frequency (ATF)conditional probabilityclumped distribution Notes 1. Refractions Research (RR)'s white paper. http://refractions.net/expertise/whitepapers/ogcsurvey/ogcsurvey/ 2. Skylab Mobilesystems' WMS list. http://ogc-services.net/ 3. GIDB WMS Service list: http://columbo.nrlssc.navy.mil/ogcwms/servlet/WMSServlet?REQUEST=ServiceLinks
Referência(s)