Show simple item record

A hybrid scavenger grid approach to intranet search

dc.contributor.advisorSuleman, Husseinen_ZA
dc.contributor.authorNakashole, Ndapandulaen_ZA
dc.date.accessioned2014-10-30T13:54:10Z
dc.date.accessioned2018-11-26T13:53:13Z
dc.date.available2014-10-30T13:54:10Z
dc.date.available2018-11-26T13:53:13Z
dc.date.issued2009en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/8977
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/11427/8977
dc.descriptionIncludes bibliographical references (p. 93-98).en_ZA
dc.description.abstractAccording to a 2007 global survey of 178 organisational intranets, 3 out of 5 organisations are not satisfied with their intranet search services. However, as intranet data collections become large, effective full-text intranet search services are needed more than ever before. To provide an effective full-text search service based on current information retrieval algorithms, organisations have to deal with the need for greater computational power. Hardware architectures that can scale to large data collections and can be obtained and maintained at a reasonable cost are needed. Web search engines address scalability and cost-effectiveness by using large-scale centralised cluster architectures. The scalability of cluster architectures is evident in the ability of Web search engines to respond to millions of queries within a few seconds while searching very large data collections. Though more cost-effective than high-end supercomputers, cluster architectures still have relatively high acquisition and maintenance costs. Where information retrieval is not the core business of an organisation, a cluster-based approach may not be economically viable. A hybrid scavenger grid is proposed as an alternative architecture - it consists of a combination of dedicated and dynamic resources in the form of idle desktop workstations. From the dedicated resources, the architecture gets predictability and reliability whereas from the dynamic resources it gets scalability. An experimental search engine was deployed on a hybrid scavenger grid and evaluated. Test results showed that the resources of the grid can be organised to deliver the best performance by using the optimal number of machines and scheduling the optimal combination of tasks that the machines perform. A system-efficiency and cost-effectiveness comparison of a grid and a multi-core machine showed that for workloads of modest to large sizes, the grid architecture delivers better throughput per unit cost than the multi-core, at a system efficiency that is comparable to that of the multi-core. The study has shown that a hybrid scavenger grid is a feasible search engine architecture that is cost-effective and scales to medium- to large-scale data collections.en_ZA
dc.language.isoengen_ZA
dc.subject.otherComputer Scienceen_ZA
dc.titleA hybrid scavenger grid approach to intranet searchen_ZA
dc.typeThesisen_ZA
dc.type.qualificationlevelMastersen_ZA
dc.type.qualificationnameMScen_ZA
dc.publisher.institutionUniversity of Cape Town
dc.publisher.facultyFaculty of Scienceen_ZA
dc.publisher.departmentDepartment of Computer Scienceen_ZA


Files in this item

FilesSizeFormatView
thesis_sci_2009_nakashole_n.pdf5.574Mbapplication/pdfView/Open

This item appears in the following Collection(s)

Show simple item record