Data mining - CompWisdom
About us  |  Why use us?  |  Press  |  Contact us

 

Topic: Data mining


  
 Data mining - Wikipedia, the free encyclopedia
However, Data Mining applies many older computational techniques from statistics, machine learning and pattern recognition.
Data Mining is a fairly recent and contemporary topic in computing.
Used in the technical context of data warehousing and analysis, the term "data mining" is neutral.
http://en.wikipedia.org/wiki/Data_mining   (1768 words)

  
 An Introduction to Data Mining
Data mining techniques can be implemented rapidly on existing software and hardware platforms to enhance the value of existing information resources, and can be integrated with new products and systems as they are brought on-line.
Data mining techniques can yield the benefits of automation on existing software and hardware platforms, and can be implemented on new systems as existing platforms are upgraded and new products developed.
Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses.
http://www.thearling.com/text/dmwhite/dmwhite.htm   (3639 words)

  
 Two Crows: Data mining glossary
For example, a data mining software system may have an API which permits user-written programs to perform such tasks as extract data, perform additional statistical analysis, create specialized charts, generate a model, or make a prediction from a model.
Data not collected by the organization, such as data from a proprietary database, that is combined with the organization's own data.
A positive function of the difference between predictions and data estimates that are chosen so as to optimize the function or criterion.
http://www.twocrows.com/glossary.htm   (3689 words)

  
 Data Mining Software, Data Mining Applications and Data Mining Solutions
Because data mining tools are so flexible, a set of data mining guidelines and a data mining methodology have been developed to help guide the process.
Data mining tools provide a number of techniques that can be applied to any business problem.
Increasingly, organizations are using data mining tools and data mining applications together in an integrated environment for predictive analytics.
http://www.spss.com/datamine   (592 words)

  
 Data Mining Techniques
Data Mining is often considered to be "a blend of statistics, AI [artificial intelligence], and data base research" (Pregibon, 1997, p.
The concept of deployment in predictive data mining refers to the application of a model for prediction or classification to new data.
Machine learning, computational learning theory, and similar terms are often used in the context of Data Mining, to denote the application of generic model-fitting or classification algorithms for predictive data mining.
http://www.statsoft.com/textbook/stdatmin.html   (4347 words)

  
 What is Data Mining
While data mining does not eliminate human participation in solving the task completely, it significantly simplifies the job and allows an analyst who is not a professional in statistics and programming to manage the process of extracting knowledge from data.
Modern computer data mining systems self learn from the previous history of the investigated system, formulating and testing hypotheses about the rules which this system obeys.
Two other problems that surface when human analysts process data are the inadequacy of the human brain when searching for complex multifactor dependencies in data, and the lack of objectiveness in such an analysis.
http://www.megaputer.com/dm/dm101.php3   (1144 words)

  
 Data Mining and Discovery
Data Mining and Knowledge Discovery, a Springer Computing Methodologies Journal.
Data mining is an AI powered tool that can discover useful information within a database that can then be used to improve actions.
First applied in banking, data mining uses a variety of algorithms to sift through storehouses of data in search of 'noisy' patterns and relationships among the different silos of information.
http://www.aaai.org/AITopics/html/mining.html   (3275 words)

  
 Data Mining Software Guide to Data Mining Software
Data Mining comes in a variety shapes and forms depending on primarily the size of databases that have to be analyzed.
Excel, with its advanced data functions, is adequate for small companies and Access for data storage.
This is especially true when data is purchased from external vendors.
http://www.data-mining-guide.net   (379 words)

  
 CCSU - Data Mining
Data Mining was listed by the MIT Technology Review as one of ten technologies that will change the world.
“Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner.”
He is developing a course on data mining for genomics and proteomics, which he will teach online in Spring 2006.
http://www.ccsu.edu/datamining   (307 words)

  
 Benchmarking- Data Mining Benchmarking Association
To create a cooperative environment where full understanding of the performance and enablers of "best in class" data mining management processes can be obtained and shared at reasonable cost.
To support the use of benchmarking to facilitate data mining process improvement and the achievement of accuracy, timeliness and efficiency.
To use the efficiency of the association to obtain process performance data and related best practices from regarding data mining.
http://www.dmbenchmarking.com   (523 words)

  
 Welcome
The National Center for Data Mining (NCDM) at the University of Illinois at Chicago (UIC) was established in 1998 to serve as a resource for research, standards development, and outreach for high performance and distributed data mining and predictive modeling.
The NCDM is a co-founding member of the Data Mining Group (DMG), which develops the Predictive Model Markup Language (PMML) and related standards, runs two data mining testbeds (the Terabyte Challenge and the Terra Wide Data Mining Testbed), and has an active outreach program.
Developing algorithms, applications, and systems for mining distributed data.
http://www.ncdm.uic.edu   (326 words)

  
 Investor Home - Data Mining
The practice of data mining in and of itself is neither good nor bad and the use of data mining has become common in many industries.
The article discussed data mining, Michael Drosnin’s book The Bible Code (much more on this topic later), and the fact that patterns will occur in data by pure chance, particularly if you consider many factors.
The article argues that given a finite amount of historical data and an infinite number of complex models, uninformed investors might be lured into "overfitting" the data.
http://www.investorhome.com/mining.htm   (2507 words)

  
 Untangling Text Data Mining
Although this can be viewed as a standard classification task (where the class is a binary assignment to the new-event class) it is more in the spirit of data mining, in that the focus is on discovery of the beginning of a new theme or trend.
In this paper I will first define data mining, information access, and corpus-based computational linguistics, and then discuss the relationship of these to text data mining.
It is important to differentiate between text data mining and information access (or information retrieval, as it is more widely known).
http://www.sims.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html   (3940 words)

  
 Data mining [OCLC - Projects]
However, this data needs to be made to work harder in order to create value for librarians and users.
If libraries are to realize the full value of their bibliographic data—or, put another way, if libraries are to maximize the return on the investments they make to create this data—steps must be taken to release this value in innovative and useful ways.
OCLC Research has a number of projects currently underway in the Data-Mining Research Area, with plans for several future projects as well.
http://www.oclc.org/research/projects/mining   (604 words)

  
 DATA MINING 2005
The Sixth International Conference on Data Mining, Text Mining and their Business Applications (Data Mining/05) was held on the Island of Skiathos, Greece, organised by the Wessex Institute of Technology and the Federal University of Rio do Janeiro.
Interest in unstructured data mining and text mining will grow among researchers, OEM and system integrators working in sectors such as information retrieval, semantic web, linguistics and knowledge management.
The proceedings of Data Mining VI: Data Mining, Text Mining and their Business Applications, 568pp (ISBN: 1-84564-016-0) are available in hard back from WIT Press priced at £199/US$318/€298.50.
http://www.wessex.ac.uk/conferences/2005/data05   (573 words)

  
 Statistical Data Mining Tutorials
The Decision Tree is one of the most popular classification algorithms in current use in Data Mining and Machine Learning.
If you're new to data mining you'll enjoy it, but your eyebrows will raise at how simple it all is! After having defined the job of classification, we explain how information gain (next Andrew Tutorial) can be used to find predictive input attributes.
This short and simple tutorial overviews the problem of learning Bayesian networks from data, and the approaches that are used.
http://www.autonlab.org/tutorials   (3003 words)

  
 Wired News: Why Data Mining Won't Stop Terror
Data mining works best when you're searching for a well-defined profile, a reasonable number of attacks per year and a low cost of false alarms.
The basic idea was as audacious as it was repellent: suck up as much data as possible about everyone, sift through it with massive computers, and investigate patterns that might indicate terrorist plots.
But even in the most wildly optimistic projections, data mining isn't tenable for that purpose.
http://www.wired.com/news/columns/0,70357-0.html?tw=rss.index   (711 words)

  
 NCBI Tools for Bioinformatics Research
It compares the query sequence against data in NCBI's UniSTS, a unified, non-redundant view of STSs from a wide range of sources.
When possible, the information includes results of analyses that have been done on the sequence data.
The eUtils use a fixed URL syntax that translates a standard set of input parameters into values necessary for various NCBI software components to search for and retrieve data from 23 Entrez databases.
http://www.ncbi.nih.gov/Tools   (1672 words)

  
 ONLamp.com -- Data Mining Email
Many of the tables have a column of type oid, which refers to the actual data that is located in the system catalog pg_largeobject.
This article is for those who need a guide to generating information from existing data and are looking for ideas on how to do it.
The tables have been defined such that no data population can occur unless the messageid already exists in the table mailid.
http://www.onlamp.com/pub/a/onlamp/2004/04/08/datamining_email.html   (836 words)

  
 IBM Research IBM Research Knowledge Discovery & Data Mining
The challenge of extracting knowledge from data draws upon research in statistics, databases, pattern recognition, machine learning, data visualization, optimization, and high-performance computing, to deliver advanced business intelligence and web discovery solutions.
Knowledge Discovery and Data Mining (KDD) is an interdisciplinary area focusing upon methodologies for extracting useful knowledge from data.
The ongoing rapid growth of online data due to the Internet and the widespread use of databases have created an immense need for KDD methodologies.
http://domino.research.ibm.com/comm/research.nsf/pages/r.kdd.html   (205 words)

  
 KDnuggets: Data Mining, Web Mining, and Knowledge Discovery Guide
Also Data Mining Forums for Beginners, Experts, Classification and Clustering, Applications,...
KDnuggets News, Data Mining and Knowledge Discovery newsletter: data mining news, jobs, software, courses, and more (Free Subscription).
Data Mining, Web Mining, Text Mining, and Knowledge Discovery
http://www.kdnuggets.com   (97 words)

  
 Data Mining - Home Page (Misc)
The Data Mine was launched in April 1994, to provide information about Data Mining also known as Knowledge Discovery In Databases (KDD) or simply Knowledge Discovery.
You could also start with the Introduction To Data Mining.
Popular pages include: Data Mining Books And Papers,OnLine Analytical Processing (OLAP), Data Mining Journals, Data Mining Tutorials, Data Sources.
http://www.the-data-mine.com   (452 words)

  
 SQL Server Data Mining
Data Mining - Scalar Mining Structure Column Data type error...
Data Mining Managed Plug-in Algorithm API Tutorial (Tutorial)
It allows you to play with various parameters of the Microsoft_Clustering algorithm and gain an understanding of how it works.
http://www.sqlserverdatamining.com   (292 words)

  
 Predictive Data Mining and Text Mining Software
The objective is to determine the best set of rules for prediction and classification, where best is the smallest number of rules with a near-minimum error.
is a comprehensive collection of programs for efficient mining of big data.
is a comprehensive software package for predictive text mining.
http://www.data-miner.com   (343 words)

  
 Data Mining
I'm not going to do a deep dive on the data visualization, but just wanted to comment on the methodology used to extract feelings.
There were a couple of papers that looked at various social network analyses of blog data.
Tseng's paper also showed how, given a discovered community, the topicality of that community could be discovered by mining keywords as in this example:
http://datamining.typepad.com/data_mining   (3031 words)

  
 US plans massive data sweep csmonitor.com
The US government is developing a massive computer system that can collect huge amounts of data and, by linking far-flung information from blogs and e-mail to government records and intelligence reports, search for patterns of terrorist activity.
For example: With name and Social Security number stripped from their files, 87 percent of Americans can be identified simply by knowing their date of birth, gender, and five-digit Zip code, according to research by Latanya Sweeney, a data-privacy researcher at Carnegie Mellon University.
Viewing data in this way could reveal patterns not obvious in text or number form.
http://www.csmonitor.com/2006/0209/p01s02-uspo.html   (1858 words)

  
 Data Mining: Practical Machine Learning Tools and Techniques
Helps you select appropriate approaches to particular problems and to compare and evaluate the results of different techniques.
Data Mining: Practical Machine Learning Tools and Techniques
"If you have data that you want to analyze and understand, this book and the associated Weka toolkit are an excellent way to start."
http://www.cs.waikato.ac.nz/~ml/weka/book.html   (143 words)

  
 Two Crows data mining home page
Learn the basics of data mining from our popular tutorial booklet, "Introduction to Data Mining and Knowledge Discovery, Third Edition" (PDF).
Two Crows has helped users in a wide range of organizations analyze their needs, select the right tools, and implement data mining projects successfully.
This exciting technology uses sophisticated statistical analysis and modeling techniques to uncover predictive patterns and relationships hidden in organizational databases &; patterns that ordinary methods might miss.
http://www.twocrows.com   (115 words)

  
 Business intelligence, data warehousing and analytics editorial from DMReview
Columns focus on meta data, enterprise architecture, business intelligence, high availability, data warehousing and more.
Multiple, distinct physical models based on a single logical model give you the tools you need to manage complex database environments and critical metadata in an intuitive user interface.
Welcome to DMReview.com, home of DM Review magazine, the premier publication for business intelligence, analytics, integration and data warehousing.
http://www.dmreview.com   (316 words)

  
 URL's for Data Mining
An Overview of Data Mining at Dun & Bradstreet
UCLA Data Mining Lab EOSDIS Project Home Page v0
Integral Solutions - Data Mining, KDD, AI, KBS & ES
http://www.galaxy.gmu.edu/stats/syllabi/DMLIST.html   (106 words)

  
 Data Mining Lecture Notes
Tsur et al., ``Query Flocks: A Generalization of Association-Rule Mining,'' 1998 SIGMOD.
Park, M.-S. Chen, and P. Yu, ``An Effective Hash-Based Algorithm for Mining Association Rules,'' 1995 SIGMOD, pp.
Agrawal, R. Srikant: ``Fast Algorithms for Mining Association Rules'', Proc.
http://www-db.stanford.edu/~ullman/mining/mining.html   (496 words)

  
 Data Mining Resources
Data mining and information retrieval in the World Wide Web
Publications related to data mining by Heikki Mannila
Tutorial on High Performance Data Mining: Vipin Kumar and Mahesh Joshi
http://www.cisl.ucar.edu/hps/GROUPS/dm/dm.html   (830 words)

  
 Software for Data Mining and Knowledge Discovery
This is a directory of general-purpose data mining software.
Web Searching: search engines, crawlers, and similar software.
Text Analysis, Text Mining, and Information Retrieval (IR)
http://www.kdnuggets.com/software   (47 words)

  
 Data Miners Home Page
The application of survival analysis to time-to-event problems in the business world (churn, for example) has us excited.
One of many books by Data Miners staff.
Data Mining Techniques Second Edition is now in stores.
http://www.data-miners.com   (35 words)

  
 Data Mining Student Notes, QUB
1.2.4 - Differences between Data Mining and Machine Learning
http://www.pcc.qub.ac.uk/tec/courses/datamining/stu_notes/dm_book_1.html   (72 words)

  
 DMG
The Data Mining Group (DMG) is an independent, vendor led group which develops data mining standards, such as the Predictive Model Markup Language (PMML).
Open Data, River Forest, IL Oracle Corporation, Redwood Shores, CA Contact: Peter Stengard, Oracle Data Mining Technologies
KDD-2006 Workshop on Data Mining Standards, Services and Platforms (DM-SSP 06).
http://www.dmg.org   (204 words)

  
 IBM Research Almaden Research Center Computer Science
Quest) group is designing information systems that preserve the privacy and ownership of data while not impeding the flow of information.
Our work is motivated by the technical challenges posed by the emerging 'On Demand' world whose success is predicated on protecting the privacy, security, and integrity of interactions between individuals and enterprises as well as between enterprises.
http://www.almaden.ibm.com/cs/quest   (121 words)

  
 Data Mining - Home
A new and improved Data Mining Technologies Website
You can contact us by phone at :
http://www.data-mine.com   (25 words)

  
 Deep Market Advanced Stock Market Analysis
That is a couple of data points, so let’s go with the magnitude of “over 7 billion tons” for our back of the envelope calculations.
The process describes any number of technologies for capturing and storing CO2 so that it is not emitted into the atmosphere.
Well, we have a general idea of what carbon is out there that would be great to capture, but sequestering seems to be a very small part of the solution (but part of the solution!!):
http://www.deepmarket.com   (2142 words)

  
 CCSU - Data Mining
Data Mining Research Group at University of East Anglia
http://www.ccsu.edu/datamining/resources.html   (52 words)

Compwisdom
 About us   |  Why use us?   |  Press   |  Contact us

 Copyright © 2006 CompWisdom.com Usage implies agreement with terms.