Index Structure For Metadata Extracted From Large Hypertext Collections

dc.contributor.authorPathak, Achyut Pd.
dc.date.accessioned2021-12-23T04:10:26Z
dc.date.available2021-12-23T04:10:26Z
dc.date.issued2008
dc.description.abstractGrowing amount of hypertext data can be found in various contexts like weblogs and online journals, intranet webs, the World Wide Web (WWW), online communities, intraorganizational wikis and other collaborative content management platforms. In such collections, the combination of content and hyperlink structures reflect several interesting information about various phenomena like existence of cyber communities, the documents similar to a given document, the popularity and importance of documents, the probability of reaching a document from any other document by following a sequence of hyperlinks etc. These can all be determined by analyzing a hypertext web. So, different kinds of analysis can be done on hypertext collections. Doing analysis requires locating and finding some information in hypertext collection. To locate information in hypertext database requires the use of an index. Since hypertext database is large in size, we need an efficient index structure to locate information in hypertext collection. Keys are used to construct the index and to search information in the index. Urls of web pages are used as keys to construct the index for hypertext collections. Since Urls of pages are variable in length, index that supports variable length keys is needed. To achieve these, a multilevel index supporting variable length key has been constructed as an index for hypertext collections.en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/6606
dc.language.isoen_USen_US
dc.publisherDepartment of Computer Scienceen_US
dc.subjectIndex Structureen_US
dc.subjectHypertext Collectionsen_US
dc.titleIndex Structure For Metadata Extracted From Large Hypertext Collectionsen_US
dc.typeThesisen_US
local.academic.levelMastersen_US
local.institute.titleCentral Department of Computer Science and Information Technologyen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
THESIS.pdf
Size:
317.01 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: