Scatter Plot Calculator

Enter your X and Y data pairs as comma-separated values to generate a scatter plot with a linear regression line. Customize your axis labels and chart title, then get back the regression equation, correlation coefficient (r), slope, and intercept — all visualized on an interactive chart.

Enter the X (independent variable) values separated by commas.

Enter the Y (dependent variable) values separated by commas. Must match the number of X values.

Results

Correlation Coefficient (r)

--

Slope (m)

--

Y-Intercept (b)

--

R² (Coefficient of Determination)

--

Number of Data Points

--

Regression Equation

--

Scatter Plot with Regression Line

Results Table

Frequently Asked Questions

What is a scatter plot and when should I use one?

A scatter plot is a graph that displays individual data points as dots plotted along two axes — one for X (independent variable) and one for Y (dependent variable). It's used to visualize the relationship or correlation between two numeric variables, identify trends, spot outliers, and assess whether a linear model is appropriate.

How do I create a scatter plot with a regression line using this tool?

Enter your X values and Y values as comma-separated numbers in their respective fields. Make sure both lists have the same number of values. Check the 'Show Linear Regression Line' option, then click Calculate. The tool will render the scatter plot and overlay the best-fit line automatically.

How can I find the regression equation from my data?

The regression equation is displayed in the form y = mx + b, where m is the slope and b is the y-intercept. These values are calculated using the least squares method, which minimizes the sum of squared differences between actual and predicted Y values. You'll find the full equation in the results panel after calculating.

What does the correlation coefficient (r) mean?

The correlation coefficient r measures the strength and direction of the linear relationship between X and Y. Values close to +1 indicate a strong positive correlation, values near −1 indicate a strong negative correlation, and values near 0 suggest little to no linear relationship. R² tells you the proportion of variance in Y explained by X.

What are residuals and why are they useful?

A residual is the difference between an observed Y value and the Y value predicted by the regression line (y − ŷ). Residuals help you assess how well the regression line fits the data. Large or patterned residuals suggest the linear model may not be the best fit, and a non-linear model might describe the data better.

What does it mean if my data has outliers?

Outliers are data points that deviate significantly from the overall pattern of the scatter plot. They can strongly influence the slope and intercept of the regression line. If a point has a very large residual, it's likely an outlier. Always investigate whether outliers are data entry errors or genuinely extreme observations.

Can this scatter plot calculator handle non-linear data or multiple regression?

This tool performs simple linear regression between one X variable and one Y variable. If your data follows a curved pattern (e.g. exponential or polynomial), the linear regression line will be a poor fit, which the residuals will reveal. For non-linear or multiple regression analysis, specialized statistical software would be more appropriate.

How many data points do I need for a meaningful scatter plot?

A minimum of 3–5 data points is required to compute a regression line, but meaningful statistical conclusions generally require at least 10–20 data points. With very few points, the regression equation may fit the sample perfectly but generalize poorly. More data leads to more reliable estimates of slope, intercept, and correlation.

More Math Tools