what are data warehousing and data mining? how do businesses use these tools?course hero

by Amani Pfannerstill 3 min read

Both data mining and data warehousing are business intelligence tools that are used to turn information (or data) into actionable knowledge. The important distinctions between the two tools are the methods and processes each uses to achieve this goal. Data mining is a process of statistical analysis.

Full Answer

What is the difference between data mining and data warehousing?

Data mining is carried by business users with the help of engineers. Data warehousing is the process of pooling all relevant data together. Data mining is considered as a process of extracting data from large data sets. Attention reader!

What is data warehousing and why is it important?

Data warehousing is a collection of tools and techniques using which more knowledge can be driven out from a large amount of data. This helps with the decision-making process and improving information resources.

What is data warehouse and query tools?

Data flows into a data warehouse from the various databases. A data warehouse works by organizing data into a schema which describes the layout and type of data. Query tools analyze the data tables using schema. It is the process of finding patterns and correlations within large data sets to identify relationships between data.

What are the subjects in data warehouse management?

These subjects can be a product, customers, suppliers, sales, revenue, etc. A data warehouse focuses on modeling and analysis of data for decision making. Integrated: A data warehouse is constructed by combining data from heterogeneous sources such as relational databases, flat files, etc.

What is the difference between data mining and data warehousing?

The main difference between data warehousing and data mining is that data warehousing is the process of compiling and organizing data into one common database, whereas data mining is the process of extracting meaningful data from that database. Data mining can only be done once data warehousing is complete. Data warehouse is the repository ...

What is data mining?

Data Mining is used to extract useful information and patterns from data. The data mining can be carried with any traditional database, but since a data warehouse contains quality data, it is good to have data mining over the data warehouse system. Data Mining supports knowledge discovery by finding hidden patterns and associations, ...

What is data warehouse?

A data warehouse is a technique of organizing data so that there should be corporate credibility and integrity, but , Data mining is helpful in extracting meaningful patterns those are not found , necessarily by only processing data or querying data in the data warehouse.

Why is data mining important?

It provides the organization a mechanism to store huge amount of data. Data mining techniques are applied on data warehouse in order to discover useful patterns. This process must take place before data mining process because it compiles and organizes data into a common database.

Why is trend analysis important?

Trend analysis: Understanding trends in the marketplace is a strategic advantage because it helps reduce costs and timeliness to market. Fraud detection: Data mining techniques can help discover which insurance claims, cellular phone calls or credit card purchases are likely to be fraudulent.

What is data mining used for?

These tools are much more than basic summaries or queries and use much more complicated algorithms. When data mining is used in business applications , it is also referred to as business analytics or business intelligence. Consider an online retailer that sells a wide variety of products.

What is data warehouse?

A data warehouse is a collection of databases that work together. A data warehouse makes it possible to integrate data from multiple databases, which can give new insights into the data. The ultimate goal of a database is not just to store data, but to help businesses make decisions based on that data. A data warehouse supports this goal by ...

What is distributed database?

Distributed databases are used to store a database at multiple computer sites to improve data access and processing. Data mining is the process of analyzing data and summarizing it to produce useful information. Data mining uses sophisticated data analysis tools to discover patterns and relationships in large datasets.

What are the characteristics of a distributed database?

To a typical user, the distributed database appears as a centralized database. Behind the scenes, however, parts of that database are located in different places. The typical characteristics of a distributed database management system, or DBMS, are: 1 Multiple computer network sites are connected by a communication system 2 Data at any site are available to users at other sites 3 Data at each site are under control of the DBMS

What is a database in a computer?

A database consists of one or more files that need to be stored on a computer. In large organizations, databases are typically not stored on the individual computers of employees but in a central system. This central system typically consists of one or more computer servers. A server is a computer system that provides a service over a network.

What are the advantages and disadvantages of a distributed database?

The major disadvantage is that the database is much more complex to manage. Setting up a distributed database is typically the task of a database administrator with very specialized database skills.

Where are databases stored?

In summary, databases are often stored in a central computer system known as a computer server. A data warehouse is a collection of databases that work together. This makes it possible to examine patterns and trends by combining multiple databases. Distributed databases are used to store a database at multiple computer sites to improve data access ...

Why is it important to analyze data mining?

Analyzing Data Mining to Data Warehousing subset is a very important step because removing unrelated data elements will reduce the search space during the Data Mining phase.

What is the process of KDD?

Data mining is one of the more crucial steps in the process of KDD. KDD basically covers everything from the selection of data to finally evaluating the mined data. The complete KDD cycle is shown in the image below:

What is a data warehouse?

Data Warehouses are responsible for storing historical data for organizations. For example, a transaction system can hold the most recent address of a customer, but a Data Warehouse will hold all the previous addresses too. It continuously keeps adding data from various sources, apart from keeping the historical data – that’s what makes it a time-variant model. The data stored will always vary with time.

What is ETL process?

Take a look at the above image. The data that is collected from various sources (operational system, ERP, CRM, Flat Files, etc.) is made to undergo an ETL process before it’s inserted into the data warehouse. This is essentially done to remove anomalies, if any, from the data – so that no harm is caused to the Data Warehouse. ETL stands for – Extraction, Transformation, and Loading. Let’s have a look at each of these processes in detail. To understand better, we’ll use an analogy – think of a gold rush and read on!

How are data mining and data warehousing related?

Are data mining and data warehousing related? Both data mining and data warehousing are business intelligence tools that are used to turn information (or data) into actionable knowledge. The important distinctions between the two tools are the methods and processes each uses to achieve this goal. Data mining is a process of statistical analysis.

What is the relationship between data mining and data warehousing?

So the crux of the relationship between data mining and data warehousing is that data, properly warehoused, is easier to mine.

How is data warehouse related to each other?

Data warehouse experts consider that the various stores of data are connected and related to each other conceptually as well as physically. A business's data is usually stored across a number of databases. However, to be able to analyze the broadest range of data, each of these databases needs to be connected in some way.

What is the difference between data mining and statistical analysis?

The important distinctions between the two tools are the methods and processes each uses to achieve this goal. Data mining is a process of statistical analysis. Analysts use technical tools to query and sort through terabytes of data looking for patterns.

What is the difference between data mining and data warehouse?

Data warehouse refers to the process of compiling and organizing data into one common database , whereas data mining refers to the process of extracting useful data from the databases. The data mining process depends on the data compiled in the data warehousing phase to recognize meaningful patterns. A data warehousing is created to support management systems.

Why is data warehouse important?

One of the advantages of the data warehouse is its ability to update frequently. That is the reason why it is ideal for business entrepreneurs who want up to date with the latest stuff. The data mining techniques are cost-efficient as compared to other statistical data applications.

How does data mining help?

Data Mining can predict the market that helps the business to make the decision. For example, it predicts who is keen to purchase what type of products. ii. Fraud detection: Data Mining methods can help to find which cellular phone calls, insurance claims, credit, or debit card purchases are going to be fraudulent.

What is data warehouse?

A Data Warehouse refers to a place where data can be stored for useful mining. It is like a quick computer system with exceptionally huge data storage capacity. Data from the various organization's systems are copied to the Warehouse, where it can be fetched and conformed to delete errors.

What are the features of a data warehouse?

The Important features of Data Warehouse are given below: 1. Subject Oriented. A data warehouse is subject-oriented. It provides useful data about a subject instead of the company's ongoing operations, and these subjects can be customers, suppliers, marketing, product, promotion, etc.

What is data mining?

Data mining is primarily used to discover and indicate relationships among the data sets. Data mining aims to enable business organizations to view business behaviors, trends relationships that allow the business to make data-driven decisions. It is also known as knowledge Discover in Database (KDD).

Is data mining accurate?

The data mining techniques are not 100 percent accurate. It may lead to serious consequences in a certain condition. In the data warehouse, there is a high possibility that the data required for analysis by the company may not be integrated into the warehouse. It can simply lead to loss of data.

What is the difference between a data warehouse and a data mining?

Data mining is considered as a process of extracting data from large data sets, whereas a Data warehouse is the process of pooling all the relevant data together. Data mining is the process of analyzing unknown patterns of data, whereas a Data warehouse is a technique for collecting and managing data. Data mining is usually done by business users ...

What is data warehousing?

Data warehousing is a process which needs to occur before any data mining can take place. Data mining is the considered as a process of extracting data from large data sets. On the other hand, Data warehousing is the process of pooling all relevant data together. One of the most important benefits of data mining techniques is ...

Why do we need data mining?

Some most important reasons for using Data mining are: 1 Establish relevance and relationships amongst data. Use this information to generate profitable insights 2 Business can mak informed decisions quickly 3 Helps to find out unusual shopping patterns in grocery stores. 4 Optimize website business by providing customize offers to each visitor. 5 Helps to measure customer's response rates in business marketing. 6 Creating and maintaining new customer groups for marketing purposes. 7 Predict customer defections, like which customers are more likely to switch to another supplier in the nearest future. 8 Differentiate between profitable and unprofitable customers. 9 Identify all kind of suspicious behavior, as part of a fraud detection process.

Is data mining accurate?

The Data mining techniques are never 100% accurate and may cause serious consequences in certain conditions. In the data warehouse, there is great chance that the data which was required for analysis by the organization may not be integrated into the warehouse. It can easily lead to loss of information.

Can data mining be misused?

It can easily lead to loss of information. The information gathered based on Data Mining by organizations can be misused against a group of people. Data warehouses are created for a huge IT project. Therefore, it involves high maintenance system which can impact the revenue of medium to small-scale organizations.

How is data mining used?

Data mining can be used by corporations for everything from learning about what customers are interested in or want to buy to fraud detection and spam filtering. Data mining programs break down patterns and connections in data based on what information users request or provide.

What is data mining?

Data mining involves exploring and analyzing large blocks of information to glean meaningful patterns and trends. It can be used in a variety of ways, such as database marketing, credit risk management, fraud detection, spam Email filtering, or even to discern the sentiment or opinion of users. The data mining process breaks down into five steps.

How do supermarkets use data mining?

Grocery stores are well-known users of data mining techniques. Many supermarkets offer free loyalty cards to customers that give them access to reduced prices not available to non-members. The cards make it easy for stores to track who is buying what, when they are buying it and at what price. After analyzing the data, stores can then use this data to offer customers coupons targeted to their buying habits and decide when to put items on sale or when to sell them at full price.