Data mining (DM), also known as Knowledge-Discovery in Databases (KDD) or Knowledge-Discovery and Data Mining (KDD), is the process of automatically searching large volumes of data for patterns. Data mining is a fairly recent and contemporary topic in computer science. However, Data mining applies many older computational techniques from statistics, information retrieval, machine learning and pattern recognition.
Definition
Data mining can be defined as "the nontrivial extraction of implicit, previously unknown, and potentially useful information from data" W. Frawley and G. Piatetsky-Shapiro and C. Matheus, Knowledge Discovery in Databases: An Overview. AI Magazine, Fall 1992, pp. 213-228. and "the science of extracting useful information from large data sets or databases" D. Hand, H. Mannila, P. Smyth: Principles of Data Mining. MIT Press, Cambridge, MA, 2001. ISBN 0-262-08290-X. Although it is usually used in relation to analysis of data, data mining, like artificial intelligence, is an umbrella term and is used with varied meaning in a wide range of contexts. It is usually associated with a business or other organization's need to identify trends.
Data mining involves the process of analysing data to show patterns or relationships; sorting through large amounts of data; and picking out pieces of relative information or patterns that occur e.g., picking out statistical information from some data.
70-643: Windows Server 2008 Applications Infrastructure Configuration, Package Microsoft Official Academic Course Tue, 15 Jul 2008 04:00:00 -0000 Exam 70-643, Windows Server 2008 Applications Platform Configuration. The newest iteration of the Microsoft Official Academic Course (MOAC) program for network administration courses using Windows Server 2008 and mapping to the Microsoft Certified Technology Specialist (MCTS) 70-643 certification exam. The MOAC IT Professional series is the Official from Microsoft, turn-key Workforce training program that leads to professional certification Read More... CISSP: Certified Information Systems Security Professional Study Guide, 4th Edition James Michael Stewart, Ed Tittel, Mike Chapple Tue, 08 Jul 2008 04:00:00 -0000 Building on the popular Sybex Study Guide approach, this book provides 100% coverage of the CISSP Body of Knowledge exam objectives. You'll find: Read More... MCTS: Windows Server 2008 Network Infrastructure Configuration: Exam 70-642 William Panek, Tylor Wentworth Tue, 08 Jul 2008 04:00:00 -0000 Get ready for the new Windows Server 2008 certification track. With Microsoft's release of Windows Server 2008 and a new generation of certification exams, network administrators have more reason than ever to certify their expertise in the world's leading server software. Inside, find the full coverage you need to prepare for Exam 70-642: Windows Server 2008 Network Infrastructure, Configuring, Read More... Shakespeare on the Double! A Midsummer Night's Dream William Shakespeare, Mary Ellen Snodgrass Mon, 23 Jun 2008 04:00:00 -0000 "The course of true love never did run smooth." Read More... Shakespeare on the Double! Twelfth Night William Shakespeare, Mary Ellen Snodgrass Mon, 23 Jun 2008 04:00:00 -0000 "O Time, thou must untangle this, not I! Read More... Shakespeare on the Double! The Taming of the Shrew William Shakespeare, Mary Ellen Snodgrass Mon, 23 Jun 2008 04:00:00 -0000 "I am ashamed that women are so simpleTo offer war where they should kneel for peace, Or seek for rule, supremancy, and sway Read More...
Goldridge Strategic Landscapes - Visualize complex market relationships based on real-time market data contained in Goldridge databases to which clients subscribe.
KD Nuggets Tool Links - Tools for Data Mining and Knowledge Discovery. Comprehensive list of tools with Internet Links.
Knowledge Discovery In Databases: Tools and Techniques - Article by Peggy Wright that presents the results of a literature survey outlining the state-of-the-art in KDD techniques and tools.
Meta Description: [ Knowledge discovery in databases (KDD) is
the field evolving to provide automated analysis solutions; ACM Crossroads 5-2 ]
Knowmadic Inc. - Provides web automation solutions by 'driving' the browser. Applications in B2B integration, web automation, content aggregation, data warehousing, and data mining browser helpers.
Megaputer Intelligence - Data, text, and web mining software. PolyAnalyst includes in-place mining, strong Microsoft integration.
Meta Description: [ Megaputer offers data mining, text mining, and web data mining software tools for e-commerce, database marketing, and CRM; seminars, training and consulting on data mining. ]
mine - Dorian Pyle, author of Data Preparation for Data Mining, provides resources on data mining, business modeling, and analytical CRM, including: articles, White Papers, downloads, books, information on courses and consulting, extensive links, and FAQs for mining pro's and newbies.
Meta Description: [ Model + Mine is Dorian Pyle's website, devoted to data mining, modeling and analytical CRM. Dorian wrote the industry-standard Data Preparation for Data Mining. ]
Mining Customer Data - By Gary Saarenvirta. A step-by-step look at a powerful clustering and segmentation methodology.
Meta Description: [ DB2 Magazine is devoted to DB2 and targeted at database administrators, analysts, programmers, designers, consultants, and MIS/DP managers. DB2 Magazine covers a range of topics for all DB2 platforms (including IBM AIX, Hewlett-Packard HP-UX, Sun Solaris, SCO UnixWare, Linux, Microsoft Windows NT... ]
Net Perceptions - Real-time relationship marketing and personalization, integrating high-scale data mining, analytic, and recommendation technologies with a direct conduit to action.
Meta Description: [ Welcome to NetPerceptions! ]
Nonlinear Thinking - A thought process that offers simple solutions within complexity reducing the dependence on experts, consultants and external resources. Features articles and tips for discovering novel solutions to recurring problems.
Psybertron Knowledge Modelling Weblog - What, Why and How do we Know ? Research into models for knowledge management in business organisation decision support. (Supersedes Ian's Knowledge Modelling Weblog)
Meta Description: [ A weblog of knowledge modelling references and discussions. ]
Second Moment - The news and business resource for applied analytics. Powerful content weblog mixing articles, commentary, technique and critique of the intersection of academic KD research and the directed KD of corporations.
Meta Description: [ Second Moment is a dynamic meeting place for academia and industry in the fields of applied statistics and analytics. It is a platform for showcasing cutting edge academic research and a resource for industry analysts and businesses interested in applying the latest statistical and analytical too... ]
The Deep Web: Surfacing Hidden Value - White paper on the Deep Web, an area of the Internet 550 times larger than the surface web crawled by traditional search engines.
Meta Description: [ BrightPlanet - automating information from documents ]
UCI Knowledge Discovery in Databases Archive - An online repository of large datasets which encompasses a wide variety of data types, analysis tasks, and application areas. The primary role of this repository is to serve as a benchmark testbed to enable researchers in knowledge discovery and data mining to scale existing and future data analysis algorithms to very large and complex data sets.
and expert-based annotation delivers the optimal platform for knowledge discovery and understanding. ... text mining ...