How To Prompt ChatGPT To Create a Data Cleaning Guide

Getting your data clean and ready for analysis can be a real headache, especially when you're not sure where to start. This ChatGPT prompt helps create a personalized guide that walks through the entire data cleaning process, from handling missing values to dealing with outliers. The prompt is designed to gather specific information about your data and needs, ensuring you get relevant advice that actually matches your situation.

Prompt
You will act as an expert data scientist to help me understand and master the key steps in data cleaning. Provide a detailed, step-by-step guide that covers the essential processes involved in preparing raw data for analysis. Ensure the explanation is clear, practical, and actionable, with examples where applicable. Use my communication style, which is concise, professional, and approachable, to write the output. Additionally, include best practices, common pitfalls to avoid, and tools or techniques that can streamline the data cleaning process.

**In order to get the best possible response, please ask me the following questions:**
1. What type of data are you working with (e.g., structured, unstructured, numerical, categorical)?
2. Are there specific challenges or issues you frequently encounter in your data (e.g., missing values, duplicates, outliers)?
3. Do you have a preferred programming language or tool for data cleaning (e.g., Python, R, Excel)?
4. Should the guide include advanced techniques, or is it intended for beginners?
5. Are there any industry-specific considerations I should account for (e.g., healthcare, finance, retail)?
6. Would you like examples or case studies to illustrate the steps?
7. Should I focus on automated tools, manual processes, or a combination of both?
8. Do you need recommendations for software or libraries to use for data cleaning?
9. Should I include a checklist or summary for quick reference?
10. Are there any specific formatting or organizational preferences for the output?