sampcompR: A package for the comparison of samples
sampcompR.Rd
Easily analyze and visualize differences between samples (e.g., benchmark
comparisons, nonresponse comparisons in surveys) on three levels. The
comparisons can be univariate, bivariate or multivariate. On univariate
level the variables of interest of a survey and a comparison survey
(i.e. benchmark) are compared, by calculating one of several difference
measures (e.g., relative difference in mean), and an average difference
between the surveys. On bivariate level a function can calculate significant
differences in correlations for the surveys. And on multivariate levels a
function can calculate significant differences in model coefficients between
the surveys of comparison. All of those differences can be easily plotted
and outputted as a table. Visualization is based on
ggplot
and can be edited as other plots of
ggplot afterwards. For more detailed information on the methods and
example use see: Rohr, B., Silber, H., & Felderer, B. (2024). „Comparing the
Accuracy of Univariate, Bivariate, and Multivariate Estimates across
Probability and Non-Probability Surveys with Population Benchmarks“
https://doi.org/10.31235/osf.io/n6ehf.
sampcompR functions
- uni_compare
Compare Datasets Univariate and Plot Differences
- plot_uni_compare
Plot uni_compare objects
- uni_compare_table
Get a table output of a uni_compare object
- R_indicator
Calculate the R_indicator for several surveys
- biv_compare
Compare Datasets Bivariate and Plot Differences
- plot_biv_compare
Plot biv_compare objects
- biv_compare_table
Get a table output of a biv_compare object
- multi_compare
Compare two Datasets on a Multivariate Level (Any GLM Model)
- plot_multi_compare
Plot multi_compare objects
- multi_compare_table
Get a table output of multi_compare objects
- multi_compare_merge
Combine two multi_compare objects, to plot them together
- descriptive_table
Get a Descriptive Table for Every Data Frame
- dataequalizer
Equalize dataframes
uni_compare function
uni_compare Returns data or a plot showing the difference of two or more
data frames The differences are calculated on the base of
differing metrics, chosen in the funct argument.
Results can be visualized using plot_uni_compare
.
biv_compare function
biv_compare Returns data or heatmap of difference between two or
more data frames, by comparing their correlation matrices. The comparison is
based on Pearson's r, calculated using the rcorr
function.
Results can be visualized using plot_biv_compare
.
multi_compare function
multi_compare Returns data of difference between two data frames
on a multivariate level. Similar (multivariate) regression models are
compared between the surveys. Only GLM models are possible. Results can be
visualized using plot_multi_compare
.
dataequalizer function
dataequalizer compares two data frames and looks if data frames contain columns with the same name. A copy of y is returned, containing only columns named identical in x and y data frames. The function is mainly used in the other functions of the package.
_PACKAGE