Open In App

Collection and Presentation of Data

Last Updated : 10 May, 2022
Improve
Improve
Like Article
Like
Save
Share
Report

We come across a lot of information every day from different sources. Our newspapers, TV, Phone and the Internet, etc are the sources of information in our life. This information can be related to anything, from bowling averages in cricket to profits of the company over the years. These facts and figures are often numerical and are called Data. Statistics is the study of data. Let’s look into this in detail. 

Statistics – Collection and Presentation of Data

Before going into Statistics, first, let’s define what is Data. 

“Data are units of information, often numeric, collected through observation.” 

It is plural form of the Latin word “Datum”.

Our world has become very information-oriented in the past two decades. So, it becomes essential for us to extract meaningful information out of data. For that we need statistics. Let’s see what statistics mean in formal terms. 

Statistics is derived from Latin word “Status” which means “a state”. It concerns with the nature, meaning and distribution of the data. 

Collection of Data

Collection of data refers to collecting information about something with an objective to analyze it or extract some meaningful information from it. Some examples of activities involving the collection of data are: 

  1. Students collecting data from their localities about the number of people with Covid Vaccines.
  2. A Football fan collecting information about the goals scored by his favorite player.
  3. A record company collecting information about album sales by their artists.

Types of Recorded Data

Most of the time when we collect data for our experiment with an objective. It usually falls into one of these two categories: 

  1. Categorical Data
  2. Numerical Data

Categorical Data

This data represents the characteristics of something entity. For example, if we are collecting data about some people. Categorical data related to this information might be, gender of the person, marital status, etc. These things will have values that are not numerical, often “Yes/No” or in this case “Male/Female”. Since they are not numerical, they cannot be added together. 

Numerical Data 

This data comes out of measurement and is numerical in nature. For example, Weight of the person, stock prices, marks of students of class XII, etc. This data is also called quantitative data. It can be broken down further into types: 

  1. Continuous Data
  2. Discrete Data

Continuous Data: This data can take any value between intervals. The number of possible values for this data cannot be counted. For example Length of a ruler can take any length between 0-100cm. It can be either 30cm, 30.11cm and so on. There are infinitely many possible values. 

Discrete Data: This data takes only certain values. For example: If a coin is tossed three times, and we want to count the number of heads. There are only a handful of values that are possible. 0,1,2 or 3. It cannot take 2.2 or any other value. So, there are only finite possible values. 

Presentation of Data

After collecting the data, we need to present it in a meaningful way. Let’s take an example, 

Suppose we have the data of heights of students in a class, 

140, 161, 152, 184, 135, 168 and 144.

We need to answer the following questions related to the data: 

  1. What is the height of the longest student in the class?
  2. What is the height of the shortest student in the class?
  3. What is the average height?

It is a little difficult to analyze the data in this format. The data in the form is called raw data. Analyzing the data in this form might take more time if the data is big. It can be made a little easier if sort the data in ascending or descending order.  Thus, in this way, the presentation of data affects the information and the time taken to extract it from the data. 

Suppose if this data was even bigger, then it would be very difficult to organize the data in sorted order. In such cases, we might use a frequency table. Let’s see this through an example. 

Un-Grouped Frequency Distribution

In this type of frequency table, we consider the values as it is and then count their number of occurrences in the data. We don’t group the data. Let’s see this through an example. 

Question: Let’s say we have marks of students of class XII. The marks are out of 40. 

20  21 29 15 7 10
31 40 24 5 11 13 20
24 27 13 15 38 33 29

Represent this data using a frequency table. 

Solution: 

Let’s take marks of some student in one column and frequency of such marks in another column. 

Marks Frequency
5 1
7 1
8 1
10 1
11 1
13 2
15 2
20 2
21 1
24 2
29 1
33 1
38 1
40 1

Notice that in this table, we have not grouped the data instead we have taken exact values and their frequency. So, this type of representation is called ungrouped frequency distribution. 

Grouped Frequency Distribution

The previous kind of representation is definitely an improvement over previous representations but as seen in the above example, tables can get pretty big in such representations. Tally Marks and grouping can also be used to represent this data. 

Question: We have the data for the number of covid cases on a particular day in 20 cities. 

10 21 25 33
15 8 16 20
0 5 38 28
5 0 16 23

Represent this data using a frequency table. 

Solution: 

In the previous example we saw that ungrouped frequency distribution is cumbersome and very long to look at. So now, we will divide the data into groups. This kind of frequency table representation is called grouped frequency representation. 

Let’s divide the numbers of cases in the groups like, 0-5, 5-10, 10-15 … and so on. 

Then the frequency table will become, 

Group Frequency
0-5 2
5-10 3
10-15 1
15-20 3
20-25 2
25-30 2
30-35 1
35-40 1

The intervals like 0-5, 5-10 .. And so on given in the above example are called class intervals. The larger number is called higher limit and the lower number is called the lower limit. 

Let’s see some sample problems on these concepts 

Sample Problems

Problem 1: The table below represents the data. Represent this data in the form of suitable frequency distribution. 

3 4 3 3
2 4 4 3
2 2 2 3

Solution: 

We can see from the data given above, that there are only three values – 2,3 and 4. These values occur multiple times throughout the data. Since there are very less number of values, we can represent this kind of data in the form un-grouped frequency table. 

Value Frequency
2 4
3 5
4 3
Total – 12

Problem 2: The data given below represents the blood groups of the 20 students of class XI. 

O AB A
AB AB O B
A A O B
B O B A
B AB O B

Represent the data given above in the table in the form of a frequency table. Which of the following blood group has the highest frequency among the students?

Solution: 

We know there are four types of blood groups in the table. 

O, A, AB and B

So, we will use ungrouped frequency distribution table to represent the data. 

Blood Group Frequency 
O 5
A 5
AB 4
B 6
Total  20

From the frequency distribution table we can tell the B is the blood group which most commonly occurring in students. 

Problem 3: The table represents the weights of the students of class X. 

60 73 62 54
48 88 49 52
55 60 62 63
77 47 65 59

Answer the following questions: 

  1. What is the range in which most students lie? 
  2. Suppose students weighing more than 70 are considered overweight and those weighing less than 50 are considered as underweight. How many such students are there in the class? 

Solution: 

Let’s make a grouped frequency distribution table for this data. 

Assuming intervals like 0-10,10-20…and so on. Let’s divide the data into these intervals are count the frequency. 

Weight Group Frequency
0-10 0
10-20 0
20-30 0
30-40 0
40-50 3
50-60 4
60-70 6
70-80 2
80-90 1
Total  16

This above table represents a grouped frequency table. Now answering the questions. 

1. Most students lie in the range from 60-70. 

2. For overweight students, we need to count the number of students with weight greater than 70. It can be observed from the table that there are three such students. 

For underweight students, the number students with weight less than 50 are also three students. 

Problem 4: Three coins are tossed 20 times. The number of heads that occurred each time is recorded and given in this data below. Prepare a frequency distribution for the given data. 

0 2 1 3
2 1 1 1
3 2 0 3
2 3 2 2
2 0 1 2

Solution: 

We know there are maximum of three heads possible at each turn in this experiment. So we can actually make an ungrouped frequency distribution for such data

Number of Heads Frequency
0 3
1 5
2 8
3 4
Total  20

Thus, the table above represents the frequency table for this data. 



Previous Article
Next Article

Similar Reads

Data Compilation and Presentation| Practical Work in Geography Class 12
In this article, we will delve deep into the topic of "Data Compilation and Presentation" from Chapter 1 of the NCERT Class 12 Practical Work Geography book. These notes are specially curated by an expert team at GeeksforGeeks for all the students. Table of Content Data Compilation and PresentationData Compilation and Presentation: Short NotesData
5 min read
Layout and Views in Presentation Tool
Microsoft PowerPoint provides different types of tools to make presentations presentable and interactive for different purposes like for business, for class, for projects, etc. Layout and Views tools are one of the presentation tools. The View in power points is the look of the working space that helps you to modify your slides as you desire. Slide
5 min read
How to Add Audio to Powerpoint Presentation
PowerPoint is a presentation software program of the Microsoft Office package. PowerPoint uses a graphical approach to presentations in the form of slide shows that accompany the oral delivery of the topic. This program is widely utilized in business and classrooms and is an efficient tool when used for training purposes. It provides “Power to your
2 min read
How to Edit a Powerpoint Presentation?
Using the Microsoft PowerPoint tool, we can create professional presentations (including slides) that may be presented through a projector or on a screen (computer/Laptop). A PowerPoint presentation is an effective technique to transmit information. To a big audience, usually in the form of an outline. PowerPoint presentations are popular with user
6 min read
Types of Presentation Tools in MS Office
Presentations are a crucial aspect of communicating ideas and information, whether it be in the classroom, the boardroom, or any other setting. With the growing reliance on technology, finding the right presentation tool has become increasingly important. As a computer science student, you have a unique understanding of the capabilities and limitat
6 min read
Class 8 RD Sharma Solutions - Chapter 23 Data Handling I (Classification And Tabulation Of Data) - Exercise 23.1
Question 1: Define the following terms: (i) Observations Solution: Observation is the value of a particular variable at a particular period.OREach entry in the given data is called an observation. (ii) Raw data Solution: Raw data is the data collected in its original form.ORRaw data is a collection of observations by an observer. (iii) Frequency of
7 min read
Class 8 RD Sharma Solutions - Chapter 23 Data Handling I (Classification And Tabulation Of Data) - Exercise 23.2
Problem 1: The marks obtained by 40 students of class VIII in an examination are given below: 16, 17, 18, 3, 7, 23, 18, 13, 10, 21, 7, 1, 13, 21, 13, 15, 19, 24, 16, 3, 23, 5, 12, 18, 8, 12, 6, 8, 16, 5, 3, 5, 0, 7, 9, 12, 20, 10, 2, 23. Divide the data into five groups namely 0-5, 5-10, 10-15, 15-20, and 20-25, and prepare a grouped frequency tabl
6 min read
Class 8 RD Sharma- Chapter 25 Data Handling III (Pictorial Representation Of Data As Pie Charts Or Circle Graphs) - Exercise 25.2
Question 1. The pie-chart given in the following represents the expenditure on different items in constructing a flat in Delhi. If the expenditure incurred on cement is Rs 112500, find the following:(i) Total cost of the flat.(ii) Expenditure incurred on labour. Solution: (i) Expenditure incurred on cement = [Tex]\frac{angle}{360°}[/Tex] x total co
5 min read
Class 8 RD Sharma Solutions - Chapter 24 Data Handling II (Graphical Representation of Data as Histograms) - Exercise 24.1 | Set 1
Question 1. Given below is the frequency distribution of the heights of 50 students of a class:Class Interval 140 – 145145 – 150150 – 155155 – 160160 – 165Frequency81218105Draw a histogram representing the above data. Solution: To draw a histogram first construct x-axis and y-axis, where the x-axis represents class interval and the y-axis represent
3 min read
Class 8 RD Sharma Solutions - Chapter 24 Data Handling II (Graphical Representation of Data as Histograms) - Exercise 24.1 | Set 2
Chapter 24 Data Handling II (Graphical Representation of Data as Histograms) - Exercise 24.1 | Set 1Question 7. Draw a histogram to represent the following data:Monthly Salary (in Rs) No. of Teachers5600 – 570085700 – 5800 45800 – 5900 35900 – 600056000 – 610026100 – 620036200 – 630016300 – 6400 2 Solution: To draw a histogram first construct x-axi
5 min read