122. Describe each of the four V's of data analytics: variety, volume, velocity, and veracity.
Dirty data is of such poor quality data that they lack integrity and cannot be trusted. Too often managers and information workers are actually constrained by data that cannot be trusted because they are incomplete, out of context, outdated, inaccurate, inaccessible, or so overwhelming that they require weeks to analyze. In those situations, the decision maker is facing too much uncertainty to make intelligent business decisions. This can result in lost business and costs related to correcting and/or preventing errors. The cost of poor quality data is expressed by the following formula.
Data mining is important because any customer can become a brand advocate or adversary by freely expressing opinions and attitudes that reach millions of other customers on social media.
An organization contracting a vendor to provide a IaaS on a public cloud is responsible for maintaining the operating system used by the virtual machines hosted in the IaaS environment.
The fact that the organization has an existing virtualized datacenter means that it is likely that it can use existing resources to deliver the application, without incurring the extra expense of subscribing to a cloud solution.