Help   About ProQuest | 

Dissertations & Theses
The world's most comprehensive collection of dissertations and theses.Learn More...

Citation/Abstract

Print  |  Email  |  Order a Copy  
Extracting trends in high-dimensional datasets
by Pokharkar, Snehal, M.S., Wayne State University, 2009, 86 pages; AAT 1461996

Abstract (Summary)

High-dimensional data analysis is an important research area in today's world, due to the rapid growth in the amount of data collected. To that end, this thesis seeks an information- revealing representation for high-dimensional data distributions that may contain local trends in certain subspaces. Examples are data that have continuous support in simple shapes with identifiable branches. Such data can be represented by a graph that consists of segments of locally fit principal curves or surfaces summarizing each identifiable branch. This thesis describes a new algorithm to find the optimal paths through such a principal graph. The paths are optimal in the sense that they represent the longest smooth trends through the data set, and jointly they cover the data set entirely with minimum overlap. The algorithm is suitable for hypothesizing trends in high-dimensional data, and can assist exploratory data analysis and visualization. Additionally, another algorithm called IRST which identifies Information Rich Subsets of High-Dimensional data and extracts the order based Subspace Trends present in them is also developed in this thesis. The notion of Trends, the implementation details, the complexities and analysis along with results on synthetic and real world sample datasets are described.

Indexing (document details)

Advisor:Reddy, Chandan K.
Committee members:Schwiebert, Loren,  Jamil, Hasan
School:Wayne State University
Department:Computer Science
School Location:United States -- Michigan
Keyword(s):Data mining, Exploratory data analysis, Trends
Source:MAI 47/04, Aug 2009
Source type:Dissertation
Subjects:Computer science
Publication Number: AAT 1461996
Document URL:http://proquest.umi.com/pqdlink?did=1674964271&Fmt=7&clientI d=79356&RQT=309&VName=PQD
ProQuest document ID:1674964271


 

 » Purchase the full text

Dissertations and theses can be purchased in a variety of formats which may include: PDF for web download, softcover, hardcover, or microform. Click the "Order a Copy" button to see the formats available for this item.

Available without purchase:

Preview  Preview

Print  |  Email  |  Order a Copy  
^Back to Top
Copyright © 2009 ProQuest LLC. All rights reserved. Terms and Conditions