Coder Social home page Coder Social logo

hrmazumder / probability-and-nonprobability-sampling Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 879 KB

Comparison of probability (simple random sampling) and non-probability sampling (convenience sampling) methods. R code for simulating a convenient sample, simple random sample, and compare bias and relative efficiency of corresponding sample means.

R 100.00%
probability-sampling nonprobability-sampling convenience-sampling simulation-study-in-r relative-efficience

probability-and-nonprobability-sampling's Introduction

A Simulation Study to Compare the Bias and Efficiency of the Sample Mean Obtained from Simple Random (SRS) and Convenience Samples (CS)

Non probability sampling is a typically cheap, easy to implement and often used sampling method in health related studies, such as in clinical trials where volunteers are used. Although non probability sampling is very practical, it is known that the samples might not be representative and it can cause sampling bias. There are very few studies in literature that compare probability sampling to nonprobability sampling via simulations. In this project, we assess the impact of convenience sampling on the estimation of the population mean, especially in regards to its bias and efficiency, compared to simple random sampling using an extensive Monte Carlo study.

Simulation Study Design:

Output from the Simulation Study:

Discussion from the Output:

• Increasing the sample size has no effect on the bias of the sample mean (zbar) from CS • Increasing the 𝛼 value decreases the bias in the sample mean (zbar) 𝑧obtained from CS. This is due to the fact that increasing 𝛼 values increase the selection probability • Increasing the 𝛼 values increase the REs, i.e. CS becomes competitive with SRS as 𝛼 values increase • Increasing the 𝛽 value, however causes an increase in the bias of the sample mean (zbar) obtained from CS • Similarly, increasing the 𝛽 value, decreases the RE (i.e. SRS’s MSE gets better) • While increasing the 𝛾 values also causes an increase in the bias of the sample mean (zbar) obtained from CS, this increase stabilizes quickly. However, the REs get smaller with higher 𝛾 values • We want to highlight that for very high 𝛼 values, CS sampling becomes pretty competitive with SRS • For such high 𝛼 values the biases from CS and SRS become very close • For small 𝛾 values RE values become greater than 1

Conclusion:

Our study confirms that the bias and the MSE of the sample mean from CS is higher than that of SRS. However, we observed that there are cases, such as when there is a correlation between an outcome and a covariate where the covariate is also correlated with selection probability, if this probability is high, then CS becomes competitive with SRS. Since the distinction between probability and non probability sampling has been rapidly diminishing due to high nonresponse rates (general) and reduced coverage (phone surveys). Besides, as long as some adjustment techniques are applied (such as post stratification) the selection bias from CS can be eliminated. Thus, CS offers a good alternative within the non probability sampling methods.

Reference: https://ww2.amstat.org/meetings/jsm/2021//onlineprogram/AbstractDetails.cfm?abstractid=318968

Presented in Joint Statistical Meetings (JSM) 2021, American Statistical Association.

**My contribution: writing the entire R code for analysis (SRS, CS, Simulation Study and Making Plots) and providing with insights from analysis!

probability-and-nonprobability-sampling's People

Contributors

hrmazumder avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.