1. Data munging is
View AnswerA Process to clean messy data
2. Can a technically correct dataset still be incorrect for data analysis?
a) YES
b) NO
View Answer
YES
3. Binning is a method to manage data
View Answernoisy data
4. Data cleaning is the most time consuming process in data analysis
a) True
b) False
View Answer
True
5. tail() function shows ___ by default
View Answer6 rows
6. ____ can be used to view data distribution of a single variable AND ____ can be used to view relation between 2 variables
View Answerhist(),plot()
7. Consider cars built-in R dataset and find out what is the median of dist variable
View Answer36
8. Using head function, identify the 8th row of mtcars built-in dataset
View Answer10 26
9. In a tidy data set ___ forms a row and ____ forms a column
View AnswerObservation,Variable
10. Identify the function which is part of dplyr package that helps in previewing the data.
View Answerglimpse()
11. A dataset with columns (country, disease, #ofdeaths) has values Row1 - (CONGO, TB, 28) Row2 - (SPAIN, TB, 2) Row3 - (EGYPT, TB, 0). Is this is a tidy or messy dataset.?
View AnswerTidy Data
12. filter() is for selecting columns and select() is for selecting rows
a) True
b) False
View Answer
No comments:
Post a Comment