AP Computer Science Principles – Unit 2.3

Data Collection & Visualization – Class Notes

1. What Is Data Collection?

Data collection is the process of gathering information so it can be analyzed and used to make decisions. In computer science, data is collected from many sources, such as:

Why it matters: Modern computing relies on data to understand behavior, solve problems, and design better systems.

2. Why Do We Collect Data?

Data helps us:

3. Identifying Patterns and Trends

Patterns and trends are the key outcomes of data analysis:

A. Patterns

A pattern is a repeated or predictable form in the data.

Examples:

B. Trends

A trend is a change in data over time.

Examples:

Why They Matter in CSP

Patterns and trends help computer scientists:

4. Data Visualization

Data visualization means turning data into pictures so it’s easier to understand.

Visualization helps us:

Good visualizations can reveal things that raw data cannot.

5. Tools for Data Visualization

A. Charts

Charts make numbers easier to understand. Common types:

Chart Type Best For
Bar Chart Comparing categories
Line Graph Showing change over time
Pie Chart Showing percentages of a whole
Histogram Showing distribution of data
Scatter Plot Showing relationships between two variables

B. Graphs

Graphs show relationships between data points.

Examples:

C. Simulations

Simulations model real-world processes when collecting actual data is too expensive, dangerous, or slow.

Examples:

Benefits of simulations:

6. Using Visualization to Make Decisions

Visualization helps users and organizations:

Example: A school visualizes student performance data and sees a drop on Fridays, so they adjust homework due dates.

7. Key AP Exam Takeaways