This guide explains how to use the AI-powered Data Quality Assessment (DQA) Agent to get insights about your sensor data quality.
The DQA Agent is an AI-powered chat assistant that helps you:
- Ask questions about your sensor data quality
- Get insights and recommendations
- Understand data quality metrics
- Receive explanations of analysis results
- Get help with data quality issues
The DQA Agent is a conversational AI assistant powered by OpenAI that:
- Understands your data quality context
- Accesses your sensor data and metadata
- Provides intelligent responses to questions
- Offers recommendations and insights
- Explains complex concepts in simple terms
The agent can help with:
- Data Quality Questions: Ask about completeness, accuracy, consistency
- Sensor Analysis: Get insights about specific sensors
- Metric Explanations: Understand what quality metrics mean
- Recommendations: Receive suggestions for improving data quality
- Troubleshooting: Get help with data quality issues
- Open the application at http://localhost:8000
- Click DQA Agent in the sidebar or home page
- The chat interface opens automatically
- Choose a machine group from the dropdown
- This provides context for the agent
- Agent can access data for selected machine group
Note: You can chat without selecting a machine group for general questions.
- Type your question in the message input box
- Press Enter or click Send
- Wait for the agent's response
- Read the agent's response
- Ask follow-up questions
- Request clarifications
- Get more details on specific topics
Examples:
- "What is the overall data quality for this machine group?"
- "Which sensors have the most missing values?"
- "Are there any accuracy issues I should be concerned about?"
- "How does data quality compare across different sensors?"
Examples:
- "Tell me about sensor 22PI102"
- "What are the threshold values for pressure sensors?"
- "Which sensors are most correlated?"
- "Are there any sensors with high outlier rates?"
Examples:
- "What does completeness percentage mean?"
- "How is accuracy assessed?"
- "What are outliers and why do they matter?"
- "Explain correlation analysis"
Examples:
- "What should I do about sensors with high missing values?"
- "How can I improve data quality?"
- "Which sensors need maintenance?"
- "What are best practices for data quality?"
Examples:
- "Why is my data quality assessment showing poor results?"
- "What could cause high missing values?"
- "How do I fix accuracy issues?"
- "What should I check if sensors have many alarms?"
Good:
- "What is the missing value percentage for sensor 22PI102?"
- "Which pressure sensors have accuracy issues?"
Less Effective:
- "Tell me about data"
- "What's wrong?"
Good:
- "For the KT2201 machine group, which sensors have the most missing values?"
- "In the last month, what is the overall data quality?"
Less Effective:
- "What's the quality?"
- "Tell me about sensors"
Example Conversation:
- User: "What is the data quality for KT2201?"
- Agent: [Provides overview]
- User: "Which specific sensors have issues?"
- Agent: [Lists sensors with problems]
- User: "What should I do about sensor 22PI102?"
The agent understands natural language, so you can:
- Ask questions conversationally
- Use technical terms or plain language
- Request explanations
- Ask for recommendations
Agent responses may include:
- Text Explanations: Detailed answers to your questions
- Lists: Bulleted or numbered lists
- Recommendations: Actionable suggestions
- Code/Data: Technical details when relevant
- Markdown Formatting: Structured information
Direct Answers:
- Straightforward responses to specific questions
- Clear and concise information
Analytical Responses:
- Analysis of your data
- Comparisons and insights
- Pattern identification
Recommendations:
- Actionable suggestions
- Best practices
- Improvement strategies
Explanations:
- Concept explanations
- Metric definitions
- Process descriptions
- Agent accesses data from selected machine group
- May not have access to all historical data
- Limited to data in the database
- Responses are AI-generated and should be verified
- May not always be 100% accurate
- Use as guidance, not absolute truth
- Agent understands conversation context
- May need clarification for complex questions
- Works best with specific questions
- Requires valid OpenAI API key
- Internet connection needed
- May have usage limits
- Select Machine Group: Provides better context
- Be Specific: Ask clear, specific questions
- Use Follow-Ups: Build on previous responses
- Verify Information: Cross-check important findings
- Ask for Examples: Request examples when needed
- Request Clarification: Ask agent to explain if unclear
User: "What is the overall data quality for KT2201?"
Agent: "Based on the data for KT2201, the overall data quality is good. The completeness is 94.5%, which is above the acceptable threshold. However, there are some sensors with accuracy issues that should be addressed..."
User: "Which sensors have accuracy issues?"
Agent: "The sensors with the most accuracy issues are: 22PI102 (15% alarm rate), 22TI111 (12% alarm rate), and 22PI103 (10% alarm rate). These sensors are exceeding their threshold values frequently..."
User: "What does completeness percentage mean?"
Agent: "Completeness percentage measures how much of the expected data is actually present. It's calculated as: (Actual Readings / Expected Readings) × 100. For example, if you expect 1000 readings but only have 950, the completeness is 95%..."
User: "How can I improve data quality for my sensors?"
Agent: "Here are some recommendations to improve data quality: 1. Address sensors with high missing values by checking hardware and connections. 2. Calibrate sensors with accuracy issues. 3. Review and adjust threshold settings if needed. 4. Implement regular maintenance schedules..."
Problem: No response from agent Solutions:
- Check OpenAI API key is configured
- Verify internet connection
- Check backend service is running
- Review error messages
Problem: Agent provides incorrect information Solutions:
- Verify data is loaded correctly
- Check machine group selection
- Ask more specific questions
- Cross-check with other analyses
Problem: Agent takes too long to respond Solutions:
- Wait for response (may take time)
- Check network connection
- Verify backend performance
- Reduce question complexity
After using the DQA Agent:
- Verify Findings: Cross-check with other analyses
- Take Action: Implement recommendations
- Re-assess: Check improvements after changes
- Explore More: Use other analysis features
- Data Quality Guide - Comprehensive quality assessment
- Missing Values Guide - Missing data analysis
- Invalid Values Guide - Invalid readings analysis
Note: The DQA Agent requires a valid OpenAI API key. See Getting Started Guide for setup instructions.