How does poor data quality affect predictive and generative AI models?
Answer : A
Poor data quality significantly impacts the performance of predictive and generative AI models by leading to inaccurate and unreliable results. Factors such as incomplete data, incorrect data, or poorly formatted data can mislead AI models during the learning phase, causing them to make incorrect assumptions, learn inappropriate patterns, or generalize poorly to new data. This inaccuracy can be detrimental in applications where precision is critical, such as in predictive analytics for sales forecasting or customer behavior analysis. Salesforce emphasizes the importance of data quality for AI model effectiveness in their AI best practices guide, which can be reviewed on Salesforce AI Best Practices.
Cloud Kicks implements a new product recommendation feature for its shoppers that recommends shoes of a given color to display to customers based on the color of the products from their purchase history.
Which type of bias is most likely to be encountered in this scenario?
Answer : A
''Confirmation bias is most likely to be encountered in this scenario. Confirmation bias is a type of bias that occurs when data or information confirms or supports one's existing beliefs or expectations. For example, confirmation bias can occur when a product recommendation feature only recommends shoes of a given color based on the customer's purchase history, without considering other factors or preferences that may influence their choice.''
Which type of bias results from data being labeled according to stereotypes?
Answer : B
''Societal bias results from data being labeled according to stereotypes. Societal bias is a type of bias that reflects the assumptions, norms, or values of a specific society or culture. For example, societal bias can occur when data is labeled based on gender, race, ethnicity, or religion stereotypes.''
What is a potential outcome of using poor-quality data in AI application?
Answer : B
''A potential outcome of using poor-quality data in AI applications is that AI models may produce biased or erroneous results. Poor-quality data means that the data is inaccurate, incomplete, inconsistent, irrelevant, or outdated for the AI task. Poor-quality data can affect the performance and reliability of AI models, as they may not have enough or correct information to learn from or make accurate predictions. Poor-quality data can also introduce or exacerbate biases or errors in AI models, such as human bias, societal bias, confirmation bias, or overfitting or underfitting.''
What are the key components of the data quality standard?
Answer : B
''Accuracy, Completeness, Consistency are the key components of the data quality standard. Data quality standard is a set of criteria or measures that define and evaluate the quality of data for a specific purpose or task. Data quality standard can vary by industry, domain, or application, but some common components are accuracy, completeness, and consistency. Accuracy means that the data values are correct and valid for the data attribute. Completeness means that the data values are not missing any relevant information for the data attribute. Consistency means that the data values are uniform and follow a common standard or format across different records, fields, or sources.''
Cloud Kicks learns of complaints from customers who are receiving too many sales calls and emails.
Which data quality dimension should be assessed to reduce these communication Inefficiencies?
Answer : A
''Duplication is the data quality dimension that should be assessed to reduce communication inefficiencies. Duplication means that the data contains multiple copies or instances of the same record or value. Duplication can cause confusion, errors, or waste in data analysis and processing. For example, duplication can lead to communication inefficiencies if customers receive multiple calls or emails from different sources for the same purpose.''
Cloud kicks wants to decrease the workload for its customer care agents by implementing a chatbot on its website that partially deflects incoming cases by answering frequency asked questions
Which field of AI is most suitable for this scenario?
Answer : A
''Natural language processing is the field of AI that is most suitable for this scenario. Natural language processing (NLP) is a branch of AI that enables computers to understand and generate natural language, such as speech or text. NLP can be used to create conversational interfaces that can interact with users using natural language, such as chatbots. Chatbots can help automate and streamline customer service processes by providing answers, suggestions, or actions based on the user's intent and context.''