Unit 2: Data Literacy
ASSIGNMENT
1. Introduction to Data Literacy (Q1-15)
Data literacy refers to:
a) The ability to read and write data
b) The ability to read, understand, create, and communicate data as information
c) The ability to store data only
d) None of the above
Answer: bWhich of the following is a key skill of a data-literate person?
a) Ignoring statistics
b) Understanding data visualisations
c) Avoiding graphs and charts
d) Relying only on intuition
Answer: bData literacy helps in:
a) Making informed decisions
b) Reducing the need for analysis
c) Avoiding technology
d) All of the above
Answer: aWhich is NOT part of data literacy?
a) Interpreting graphs
b) Collecting data
c) Ignoring data sources
d) Evaluating data quality
Answer: cThe foundation of data literacy lies in:
a) Guessing trends
b) Understanding data types and formats
c) Writing computer programs only
d) Ignoring context
Answer: bIn AI, data literacy is important because:
a) AI cannot function without human input
b) Data drives AI model performance
c) It helps in designing experiments
d) All of the above
Answer: dData literacy can be improved by:
a) Analysing real datasets
b) Practising data visualisation
c) Learning statistics
d) All of the above
Answer: dWhich of the following statements is true?
a) Data literacy is only for data scientists
b) Data literacy is a skill for everyone in the digital age
c) Data literacy is not useful for decision-making
d) Data literacy is the same as computer literacy
Answer: bA data-literate student can:
a) Read graphs correctly
b) Misinterpret numbers
c) Ignore context of data
d) Avoid using data in reports
Answer: aData literacy improves:
a) Critical thinking
b) Communication skills
c) Decision-making
d) All of the above
Answer: dWhich is an example of data literacy in everyday life?
a) Choosing a product based on customer reviews and ratings
b) Buying the cheapest product without comparison
c) Relying on rumours
d) None of these
Answer: aData literacy requires both:
a) Reading and writing
b) Technical skills and analytical thinking
c) Painting and drawing
d) Guessing and predicting
Answer: bWhich skill is NOT part of data literacy?
a) Understanding charts
b) Data cleaning
c) Avoiding data visualisation
d) Recognising bias
Answer: cThe opposite of data literacy is:
a) Data fluency
b) Data illiteracy
c) Data science
d) Data analytics
Answer: bData literacy is essential in AI because:
a) AI models work only on clean, relevant data
b) AI ignores data quality
c) AI creates data literacy automatically
d) AI does not use data
Answer: a
2. Impact of Data Literacy (Q16–30)
Data literacy helps organisations:
a) Make better decisions
b) Reduce guesswork
c) Improve efficiency
d) All of the above
Answer: dWhich is NOT a benefit of data literacy?
a) Informed decision-making
b) Increased misinformation
c) Improved communication
d) Better productivity
Answer: bData literacy impacts personal life by:
a) Helping in understanding news statistics
b) Ignoring data-driven facts
c) Encouraging rumours
d) Avoiding analysis
Answer: aIn AI projects, data literacy:
a) Ensures data is used ethically
b) Leads to biased models
c) Eliminates the need for human judgment
d) Has no role
Answer: aWhich industry benefits from data literacy?
a) Healthcare
b) Education
c) Business
d) All of the above
Answer: dLack of data literacy may lead to:
a) Better decision-making
b) Data misinterpretation
c) More accurate predictions
d) Improved analysis
Answer: bData literacy helps reduce:
a) Information overload
b) Data-based errors
c) Both a and b
d) None
Answer: cGovernments use data literacy to:
a) Frame public policies
b) Spread misinformation
c) Ignore trends
d) Avoid surveys
Answer: aIn the workplace, data literacy:
a) Improves collaboration
b) Helps in clear reporting
c) Avoids bias in presentations
d) All of the above
Answer: dA data-literate society can:
a) Identify fake news
b) Make data-driven debates
c) Reduce bias
d) All of the above
Answer: dWhich statement is true?
a) Data literacy is only useful for IT jobs
b) Data literacy benefits all careers
c) Data literacy is not linked to technology
d) Data literacy reduces transparency
Answer: bWhich is an economic benefit of data literacy?
a) Higher productivity
b) Poor forecasting
c) Increased errors
d) Ignoring trends
Answer: aThe impact of data literacy in AI includes:
a) Better model accuracy
b) Bias reduction
c) Ethical AI use
d) All of the above
Answer: dData literacy supports innovation by:
a) Identifying opportunities from data
b) Reducing experiments
c) Ignoring feedback
d) None of these
Answer: aWhich of these is NOT an impact of data literacy?
a) Increased misinformation
b) Better insights
c) Enhanced problem-solving
d) More effective communication
Answer: a
3. Data Privacy and Data Security (Q31–50)
Data privacy refers to:
a) Protecting data from loss
b) Controlling who can access personal information
c) Backing up data
d) Sharing all personal data online
Answer: bData security means:
a) Protecting data from unauthorised access
b) Making all data public
c) Avoiding encryption
d) Ignoring threats
Answer: aWhich is an example of a privacy measure?
a) Using strong passwords
b) Leaving accounts logged in
c) Sharing login details
d) None
Answer: aWhich is an example of a data security threat?
a) Malware
b) Encryption
c) Firewall
d) Anti-virus software
Answer: aWhich law protects personal data in India?
a) RTI Act
b) IT Act, 2000
c) Consumer Protection Act
d) None of the above
Answer: bWhich is NOT a method to ensure data security?
a) Encryption
b) Regular backups
c) Weak passwords
d) Two-factor authentication
Answer: cData breaches occur when:
a) Hackers gain access to sensitive data
b) Data is encrypted
c) Systems are updated
d) Backups are taken
Answer: aWhich is an ethical practice in data privacy?
a) Asking user consent before collecting data
b) Selling personal data without permission
c) Ignoring data laws
d) Sharing passwords
Answer: aWhich of these is NOT related to data security?
a) Antivirus software
b) Firewalls
c) Leaving data unprotected
d) User authentication
Answer: cThe main purpose of data privacy is to:
a) Hide information from everyone
b) Allow only authorised access to personal data
c) Make data public
d) None
Answer: bWhich is an example of sensitive personal data?
a) Age group
b) Credit card number
c) Favourite colour
d) City name
Answer: bCybersecurity is directly linked to:
a) Data privacy
b) Data security
c) Both a and b
d) None
Answer: cWhich is a measure to improve data privacy?
a) Using secure websites (HTTPS)
b) Using public Wi-Fi for banking
c) Avoiding passwords
d) Sharing all details
Answer: aWhich of these attacks steals confidential information?
a) Phishing
b) CAPTCHA
c) VPN
d) Firewall
Answer: aWhy is data security important in AI?
a) To prevent model hacking
b) To ensure training data is not tampered with
c) To protect sensitive datasets
d) All of the above
Answer: dWhich is NOT a good privacy practice?
a) Reading privacy policies
b) Using weak passwords
c) Limiting app permissions
d) Avoiding sharing personal details online
Answer: bWhich is a legal requirement for companies handling personal data?
a) Transparency in data collection
b) Selling data
c) Ignoring user consent
d) None
Answer: aWhich term means hiding sensitive data in storage?
a) Decryption
b) Encryption
c) Formatting
d) Scanning
Answer: bIn data security, MFA stands for:
a) Multi-Factor Authentication
b) Multi-File Access
c) Multiple File Authentication
d) None
Answer: aWhich of these can compromise data privacy?
a) Clicking on suspicious links
b) Using secure Wi-Fi
c) Enabling two-step verification
d) Using strong passwords
Answer: a
4. Types of Data (Q51–65)
Which is NOT a type of data?
a) Structured
b) Unstructured
c) Semi-structured
d) Random data
Answer: dStructured data is:
a) Organised and stored in rows & columns
b) Without any fixed format
c) Multimedia files only
d) None of the above
Answer: aUnstructured data includes:
a) Images
b) Videos
c) Text documents without structure
d) All of the above
Answer: dSemi-structured data:
a) Has partial organisation with tags/markers
b) Is always in databases
c) Has no organisation at all
d) None of these
Answer: aJSON and XML are examples of:
a) Structured data
b) Semi-structured data
c) Unstructured data
d) Random data
Answer: bWhich data type is easiest to analyse?
a) Structured
b) Unstructured
c) Semi-structured
d) Complex data
Answer: aWhich is NOT an example of structured data?
a) Excel sheet
b) SQL database table
c) Scanned handwritten notes
d) Employee database
Answer: cText messages can be considered:
a) Structured
b) Unstructured
c) Semi-structured
d) Random
Answer: bAudio files are usually:
a) Structured
b) Unstructured
c) Semi-structured
d) None
Answer: bMetadata is:
a) Data about data
b) Large raw data
c) Encrypted information
d) None
Answer: aWhich is NOT a characteristic of unstructured data?
a) Hard to analyse
b) Fixed schema
c) May include videos/images
d) Often requires special tools to process
Answer: bSurvey responses stored in a spreadsheet are:
a) Structured data
b) Unstructured data
c) Semi-structured data
d) Complex data
Answer: aLogs from a website server are:
a) Structured
b) Unstructured
c) Semi-structured
d) Random
Answer: cWhich data type requires the most preprocessing for AI?
a) Structured
b) Unstructured
c) Semi-structured
d) None
Answer: bBig Data often contains:
a) Only structured data
b) Only unstructured data
c) All types of data
d) None
Answer: c
5. Data Acquisition (Q66–80)
Data acquisition means:
a) Buying data
b) Collecting data from various sources
c) Deleting old data
d) Encrypting data
Answer: bWhich is NOT a method of data acquisition?
a) Surveys
b) Web scraping
c) Data deletion
d) Sensor readings
Answer: cPrimary data is collected:
a) First-hand by the researcher
b) From existing reports
c) From online articles
d) None
Answer: aSecondary data comes from:
a) Interviews
b) Books and journals
c) Field surveys
d) Observation
Answer: bWhich is an automated data acquisition method?
a) Manual entry
b) API data fetching
c) Face-to-face interview
d) None
Answer: bIoT devices mainly collect data via:
a) Manual logging
b) Sensors
c) Telephone surveys
d) None
Answer: bWhich is an example of crowd-sourced data?
a) Wikipedia edits
b) Newspaper articles
c) Census reports
d) Government databases
Answer: aWeb scraping collects:
a) Offline data
b) Data from websites automatically
c) Random noise
d) None
Answer: bWhich factor is NOT important in data acquisition?
a) Accuracy
b) Relevance
c) Bias
d) Unreliability
Answer: dThe main challenge in data acquisition is:
a) Finding relevant data
b) Getting useless data
c) Ignoring data sources
d) None
Answer: aWhich of these is NOT a source of primary data?
a) Experiments
b) Questionnaires
c) Published reports
d) Direct observation
Answer: cSocial media platforms provide:
a) Structured only
b) Unstructured only
c) Both structured and unstructured data
d) None
Answer: cWhich is NOT a tool for automated data collection?
a) Google Forms
b) Python scripts
c) Pen and paper survey
d) API integration
Answer: cWhich is a risk in online data acquisition?
a) Data privacy issues
b) Faster collection
c) Reduced costs
d) Automation
Answer: aWhich technology helps in real-time data acquisition?
a) Sensors and IoT
b) Paper forms
c) Offline notebooks
d) None
Answer: a
6. Various Sources of Data (Q81–95)
Which is a government data source?
a) Census report
b) Blogs
c) Social media posts
d) Personal diaries
Answer: aWhich is an example of open data?
a) Data.gov.in datasets
b) Private company sales records
c) Confidential medical files
d) None
Answer: aWhich is NOT a public data source?
a) Wikipedia
b) Paid market research report
c) Government websites
d) Public domain books
Answer: bCommercial data sources are:
a) Paid databases
b) Public data
c) Illegal data
d) None
Answer: aWhich is an example of real-time data source?
a) Weather sensors
b) Annual reports
c) Archived news
d) Old census
Answer: aWhich data source is often most reliable?
a) Peer-reviewed journals
b) Random blogs
c) Unverified posts
d) Rumours
Answer: aWhich is NOT a type of organisational data source?
a) Internal sales reports
b) Employee records
c) Social media rumours
d) Customer databases
Answer: cSensor-based data is usually:
a) Real-time
b) Delayed
c) Historical
d) None
Answer: aWhich is an example of a secondary data source?
a) Government reports
b) Conducting your own survey
c) Taking measurements in the lab
d) Interviewing people
Answer: aSocial media data is often:
a) Structured
b) Unstructured
c) Semi-structured
d) All of the above
Answer: dWhich source provides satellite imagery data?
a) ISRO
b) Newspaper
c) School textbook
d) None
Answer: aWhich source provides business transaction data?
a) POS (Point of Sale) systems
b) Social media feeds
c) Weather station
d) None
Answer: aOnline review websites provide:
a) Only structured data
b) Only unstructured data
c) Both structured & unstructured data
d) None
Answer: cWhich is NOT a limitation of free data sources?
a) Limited accuracy
b) Limited coverage
c) Always up-to-date
d) Lack of depth
Answer: cProprietary data is:
a) Owned and controlled by an organisation
b) Public domain
c) Government open data
d) Free for all
Answer: a
7. Data Preprocessing (Q96–110)
Data preprocessing is:
a) Cleaning and preparing data for analysis
b) Deleting all data
c) Randomising data
d) None
Answer: aWhich is NOT a step in data preprocessing?
a) Data cleaning
b) Data integration
c) Data corruption
d) Data transformation
Answer: cRemoving duplicate entries is part of:
a) Data cleaning
b) Data modelling
c) Data storage
d) None
Answer: aNormalisation means:
a) Scaling data to a standard range
b) Deleting records
c) Storing data in databases
d) None
Answer: aHandling missing values is done in:
a) Data cleaning
b) Data interpretation
c) Data collection
d) None
Answer: a
Features of Data
111. Which of the following is a key feature of good data?
a) Irrelevance
b) Accuracy
c) Guesswork
d) Randomness
Answer: b
112. Data should be collected from:
a) Reliable sources
b) Any source, even if false
c) Rumors only
d) Fictional stories
Answer: a
113. Timeliness in data means:
a) Data is always old
b) Data is available when needed
c) Data is outdated
d) Data is irrelevant
Answer: b
114. Which feature ensures that data represents the true scenario?
a) Accuracy
b) Irrelevance
c) Ambiguity
d) Incompleteness
Answer: a
115. Completeness in data means:
a) Data contains all required values
b) Data is missing important details
c) Data has only half the values
d) Data is confidential
Answer: a
116. Which is NOT a feature of quality data?
a) Reliability
b) Consistency
c) Randomness
d) Validity
Answer: c
117. Valid data is:
a) Collected according to rules or formats
b) Collected randomly without rules
c) Always fake
d) Not related to the problem
Answer: a
118. Which feature of data avoids duplication?
a) Consistency
b) Uniqueness
c) Validity
d) Completeness
Answer: b
119. Accuracy in data can be affected by:
a) Human errors
b) Faulty sensors
c) Misreporting
d) All of the above
Answer: d
120. Up-to-date data is called:
a) Timely data
b) Historical data
c) Incomplete data
d) Fake data
Answer: a
Q121–Q135: Data Processing and Data Interpretation
121. Data processing means:
a) Storing data only
b) Converting raw data into meaningful information
c) Deleting unnecessary data only
d) Guessing data meaning
Answer: b
122. Which is the first step in data processing?
a) Data storage
b) Data collection
c) Data presentation
d) Data deletion
Answer: b
123. Arranging data in tables or charts is called:
a) Data collection
b) Data organization
c) Data destruction
d) Data hiding
Answer: b
124. Which step comes after data analysis?
a) Data cleaning
b) Data interpretation
c) Data collection
d) Data deletion
Answer: b
125. Data interpretation means:
a) Explaining the meaning of analyzed data
b) Storing raw data
c) Destroying old data
d) Creating fake reports
Answer: a
126. In AI, data interpretation helps in:
a) Model training
b) Decision-making
c) Finding patterns
d) All of the above
Answer: d
127. Which tool is used for automated data processing?
a) Spreadsheet software
b) Calculator
c) Whiteboard
d) Pen and paper
Answer: a
128. Which step ensures that data is correct before interpretation?
a) Data cleaning
b) Data ignoring
c) Data deletion
d) Data duplication
Answer: a
129. Data processing can be:
a) Manual
b) Electronic
c) Both a and b
d) None of these
Answer: c
130. The outcome of data interpretation should be:
a) Confusing
b) Actionable insights
c) Irrelevant facts
d) Outdated reports
Answer: b
131. The transformation of raw data into readable form is called:
a) Data encoding
b) Data processing
c) Data mining
d) Data deletion
Answer: b
132. Which step in data processing ensures removal of incorrect entries?
a) Data cleaning
b) Data interpretation
c) Data collection
d) Data duplication
Answer: a
133. Data interpretation often uses:
a) Charts
b) Graphs
c) Tables
d) All of the above
Answer: d
134. Which is a benefit of proper data processing?
a) Faster decision-making
b) Better accuracy
c) Easy communication
d) All of the above
Answer: d
135. Interpreted data helps organizations:
a) Plan strategies
b) Ignore facts
c) Create confusion
d) Avoid action
Answer: a
Q136–Q150: Data Visualisation using Tableau
136. Tableau is a:
a) Data visualization tool
b) Data collection tool only
c) Programming language
d) Word processing software
Answer: a
137. Which of these is NOT a feature of Tableau?
a) Drag-and-drop interface
b) Real-time data analysis
c) Complex coding requirement
d) Interactive dashboards
Answer: c
138. Tableau can connect to:
a) Excel files
b) Databases
c) Cloud services
d) All of the above
Answer: d
139. Which chart type is available in Tableau?
a) Bar chart
b) Line chart
c) Pie chart
d) All of the above
Answer: d
140. Tableau helps in:
a) Presenting data visually
b) Making raw data unreadable
c) Destroying data
d) Only storing data
Answer: a
141. Tableau dashboards are:
a) Static
b) Interactive
c) Boring
d) Only text-based
Answer: b
142. Which is an advantage of Tableau?
a) Complex coding skills required
b) Quick insights through visuals
c) Limited data source connections
d) Text-only reports
Answer: b
143. In Tableau, filters are used to:
a) Remove unnecessary visualizations
b) Focus on specific data values
c) Delete old projects
d) Lock charts
Answer: b
144. Tableau worksheets contain:
a) One visualization per sheet
b) Only text data
c) Only raw numbers
d) No visual elements
Answer: a
145. Which file format can Tableau read?
a) .xlsx
b) .csv
c) .txt
d) All of the above
Answer: d
146. Story feature in Tableau is used for:
a) Sequential presentation of visuals
b) Writing paragraphs
c) Saving data only
d) Data deletion
Answer: a
147. Tableau Public is:
a) Free version of Tableau
b) Paid only software
c) Programming tool
d) Spreadsheet software
Answer: a
148. Which step comes first when creating a visualization in Tableau?
a) Connect to data source
b) Publish dashboard
c) Share report
d) Add filters
Answer: a
149. Which type of chart is best for showing trends over time in Tableau?
a) Line chart
b) Pie chart
c) Donut chart
d) Scatter plot
Answer: a
150. Tableau visualization helps in:
a) Quick understanding of data patterns
b) Making data harder to read
c) Deleting important data
d) Avoiding data interpretation
Answer: a