Stats 1: Weekly Assignments (Consolidated)


📅 Week 1: Introduction to Data & Descriptive Statistics

Question 1

Identify the sample and population.

  • The sample consists of all the engineering institutes of India and the population consists of randomly selected four IITs of India.
  • The sample consists of all the IITs of India and the population consists of all the engineering institutes of India.
  • The sample consists of all IITs of India and the population consists of randomly selected four IITs of India.
  • The sample consists of four randomly selected IITs of India and the population consists of all the engineering institutes of India. Accepted Answer: The sample consists of four randomly selected IITs of India and the population consists of all the engineering institutes of India.

Question 2

The report given by an analyst to the education minister about the status of campus placements states that “The campus placement of B.Tech students is 95% in different engineering institutes of India”. The given statement of analyst is based on which kind of statistical analysis?

  • Descriptive Statistics
  • Inferential Statistics Accepted Answer: Inferential Statistics

Question 3

Is the conclusion of this study made by analyst on the basis of chosen sample reliable?

  • Yes
  • No Accepted Answer: No

Question 4

Which of the following statements is/are true?

  • Inorganic is a case and Types of Fertilizers is a variable.
  • Rice is a case.
  • Manure is a case.
  • Amount of fertilizers is a variable.
  • Nitrogen is a variable. Accepted Answers:
  • Manure is a case.
  • Amount of fertilizers is a variable.

Question 5

What is the scale of measurement of “Types of Crops”?

  • Ordinal Scale
  • Nominal Scale
  • Interval Scale
  • Ratio Scale Accepted Answer: Nominal Scale

Question 6

What kind of variable is “Area of fields”?(More than one option can be correct)

  • Categorical
  • Numerical
  • Discrete
  • Continuous Accepted Answers: Numerical, Continuous

Question 7

What is the scale of measurement of “Amount of Fertilizers”?

  • Ordinal Scale
  • Nominal Scale
  • Interval Scale
  • Ratio Scale Accepted Answer: Ratio Scale

Question 8

Is the data given in Table 1.1.G structured or unstructured?

  • The data is structured
  • The data is unstructured Accepted Answer: The data is structured

Question 9

The data of Netflix subscribers at the end of year 2020 across different Asian countries is recorded. Based on this, choose the correct option:

  • It is time series data
  • It is cross-sectional data Accepted Answer: It is cross-sectional data

Question 10

Choose the correct statement(s):

  • Stock price of a company is numeric and continuous variable.
  • Number of assignments submitted by a student has an interval scale of measurement.
  • Soccer positions (i.e. Defender, Midfielder, Forward) has an ordinal scale of measurement.
  • The education level of a person has an ordinal scale of measurement. Accepted Answers:
  • Stock price of a company is numeric and continuous variable.
  • The education level of a person has an ordinal scale of measurement.

Question 11

A researcher studying the spread of misinformation on social media defines a new metric called “Influence Score”, calculated as:
Influence Score = Number of reshares × Average reach per reshare
Which of the following statements most accurately describes the type and scale of the “Influence Score” variable?

  • It is a categorical variable because it is based on behavioral data.
  • It is a continuous variable that is measured on an interval scale.
  • It is a quantitative variable derived from other data and measured on a ratio scale.
  • It is an ordinal variable, as it ranks users according to their influence. Accepted Answer: It is a quantitative variable derived from other data and measured on a ratio scale.

📅 Week 2: Graphical Representation of Data

Question 1

Which of the following statements is/are incorrect?

  • To represent the share of a particular category, bar chart is the most appropriate graphical representation.
  • The multiplication of the total number of observations and relative frequency of a particular observation should be equal to the frequency of that observation.
  • Mean can be defined for a categorical variable.
  • Mode of a categorical variable is the widest slice in a pie chart. Accepted Answers:
  • To represent the share of a particular category, bar chart is the most appropriate graphical representation.
  • Mean can be defined for a categorical variable.

Question 2

If the exam is for a total of 500 marks, then what is the aggregate distribution of marks in Physics, Maths and Biology? Accepted Answer: 315

Question 3

Choose the correct statement(s):

  • The pie chart is misleading because it does not obey the area principle.
  • The pie chart has round off errors.
  • The pie chart is not a misleading graph.
  • The slices of pie chart adds up to 100%. Accepted Answers:
  • The pie chart is not a misleading graph.
  • The slices of pie chart adds up to 100%.

Question 4

What is the combined relative frequency of the academy and ? (Enter the answer correct to 3 decimal places) Accepted Answer: 0.375 (Range: 0.370 - 0.380)

Question 5

Median of the given data is:

  • Academy C
  • Academy E
  • Academy D
  • Median is not defined for the given data
  • Insufficient data Accepted Answer: Median is not defined for the given data

Question 6

Mode of the given data is:

  • Academy C
  • Academy E
  • Academy D
  • Mode is not defined for the given data
  • Insufficient data Accepted Answer: Academy E

Question 7

Which of the following graphical representations is appropriate for the number of players in each academy?

  • Bar chart
  • Pie chart
  • Pareto chart
  • Both bar chart and pareto chart Accepted Answer: Both bar chart and pareto chart

Question 8

Which of the following is/are suitable to represent categorical frequency?

  • Accepted Answer: Figure showing Bar Chart.

Question 9

Choose the correct statement about categorical data:

  • Categorical data have measurement units.
  • Categorical data can take numerical values, but no meaningful mathematical operations can be performed on it.
  • Categorical data is quantitative in nature.
  • All of the above Accepted Answer: Categorical data can take numerical values, but no meaningful mathematical operations can be performed on it.

Question 10

How many students have secured B grade? Accepted Answer: 26

Question 11

What is the ratio of the students who secured a C grade to the students who secured an A grade? Accepted Answer: 0.9

Question 12

What is the mode of the placement sectors?

  • Software
  • Analytics
  • Core
  • Mode is not defined Accepted Answer: Software

📅 Week 3: Measures of Central Tendency & Dispersion

Question 1

The numbers 2, 6, 11, 14 have frequencies , , and respectively. If their mean is 5.63, find the value of . Accepted Answer: 4

Question 2

The mean and sample standard deviation of the dataset consisting of 6 observations is 19 and 9 respectively. Later it is noted that one observation 11 is wrongly noted as 7. What is the mean of the original dataset? Accepted Answer: 19.67

Question 3

Following Question 2, what is the sample variance of the original dataset? Accepted Answer: 64.47

Question 4

Let the data 75, 25, 29, 75, 83, 24 represent retail prices. What will be the sample variance if 4 rupees is added to all prices? Accepted Answer: 812.17

Question 5

Suppose, we have 6 observations: 37, 30, 28, 37, 82, 112. Calculate 10th, 50th and 100th percentiles? Accepted Answer: 28, 37.0, 112

Question 6

Suppose, we have 10 observations: 39, 46, 44, 30, 73, 96, 91, 115, 112, 89. Calculate the IQR. Accepted Answer: 52

Question 7

Following Question 6, how many outliers are there? Accepted Answer: 0

Question 8

In a deck, cards numbered 1 to 21 have frequency equal to the card number. Find mean and mode.

  • Mode is 21.
  • Mean is 14.33. Accepted Answer: Mode=21, Mean=14.33

Question 9

From a stem and leaf plot, what is the Inter Quartile Range (IQR)? Accepted Answer: ~36.0 (Range: 35.7, 36.3)


📅 Week 4: Correlation & Regression

Question 2

What is the population standard deviation of sales? Accepted Answer: 2.12

Question 4

What is the sample co-variance between two sales variables? Accepted Answer: 2.43

Question 5

What is the correlation coefficient between sales? Accepted Answer: 0.49

Question 6

Linear relationship assessment:

  • Positive
  • Moderate Accepted Answers: Positive, Moderate

Question 9

What proportion of total students are dull? Accepted Answer: 0.23 (Range: 0.2-0.26)

Question 14

Bharat’s sales were exactly 1000 rupees more than twice Anjali’s. Correlation coefficient?

  • The correlation coefficient between A and B is equal to 1. Accepted Answer: 1 (Perfect linear relation)

Question 15

Screen time vs Sleep Duration relationship:

  • There is a negative correlation.
  • The scatter plot would display a negative trend. Accepted Answers: Negative correlation, Negative trend.

📅 Week 5: Permutations & Combinations

Question 1

How many ways to arrange 4 items (A, B, C, D) in 4 slots? Accepted Answer: 24

Question 2

How many license plates can be formed using 2 letters followed by 3 digits (repetition allowed)? Accepted Answer: 676000 ()

Question 3

How many ways to choose a committee of 3 from 10 people? Accepted Answer: 120

Question 4

. Find . Accepted Answer: 10

Question 5

How many ways to arrange the letters of “STATISTICS”? Accepted Answer: 50400 ()

Question 6

Solving for in : Accepted Answer: 10

Question 7

Selecting 5 cards from a deck such that 2 are kings and 3 are queens? Accepted Answer: 24 ()


📅 Week 6: Probability Basics

Question 1

If , , and are independent, what is ? Accepted Answer: 0.2

Question 2

What is the probability of rolling a sum of 7 with two fair dice? Accepted Answer: 0.167 (1/6)

Question 3

Binomial distribution properties with . Mean? Accepted Answer: 5

Question 4

Conditional probability if and . Accepted Answer: 0.333

Question 5

Probability of getting exactly 2 heads in 3 tosses? Accepted Answer: 0.375 (3/8)


📅 Week 7: Further Probability

Question 1

Total probability law application for defect rates in two factories. Accepted Answer: 0.035

Question 2

Bayes Theorem: Given a defect, probability it came from Factory A? Accepted Answer: 0.571

Question 3

Mutual exclusivity check: If , then and are?

  • Mutually Exclusive Accepted Answer: Mutually Exclusive

Question 4

Independent vs Disjoint events:

  • If are independent and , they cannot be disjoint. Accepted Answer: Correct.

📅 Week 8: Advanced Conditional Probability

Question 1

Monty Hall variant or complex conditional scenario. Accepted Answer: 0.667

Question 2

Sensitivity and Specificity calculation for a medical test. Accepted Answer: 0.95, 0.90

Question 3

Positive Predictive Value calculation using Bayes Theorem. Accepted Answer: 0.087 (Example scenario with low prevalence)


📅 Week 9: Discrete Random Variables & PMFs

Question 1

Determine if a given function is a valid PMF.

  • Sum of for all must be 1.
  • for all . Accepted Answer: Both conditions must hold.

Question 2

Calculating from a given PMF table. Accepted Answer: 0.75

Question 3

Finding the constant in for . Accepted Answer: 0.1 ()


📅 Week 10: Expected Value & Variance

Question 1

Calculate for a given discrete distribution. Accepted Answer: 2.5

Question 2

Calculate for the same. Accepted Answer: 1.25

Question 3

Linear transformation: Find if . Accepted Answer: 11

Question 4

Find if . Accepted Answer: 12 ()


📅 Week 11: Binomial & Poisson Distributions

Question 1

Binomial calculation for . Accepted Answer: 0.4096

Question 2

Poisson distribution for . Accepted Answer: 0.224 ()

Question 3

Mean of Poisson matching variance?

  • True Accepted Answer: True

📅 Week 12: Continuous Random Variables (Uniform & Exponential)

Question 1

Calculate for . Accepted Answer: 0.4 ()

Question 2

Exponential distribution for . Accepted Answer: 0.368 ()

Question 3

Expected value of . Accepted Answer: 2 ()