Skip to main content

Questions tagged [survey-weights]

survey weights are used when data are collected according to a probability sampling design with unequal probabilities of selection and/or response

1 vote
0 answers
23 views

Combining Survey Weights

I have an annual survey that is somewhat complex in design. Sample frames are pulled quarterly and overlap. Each quarter's sample is removed from the subsequent quarter's frame. Samples are stratified ...
ReliableResearch's user avatar
0 votes
0 answers
33 views

Is bootstrapping inherently Frequentist? If so, how do we do a Bayesian non-parametric two-sample test?

I normally use frequentist statistics but I now want to use Bayesian statistics as I want to carry out a two-sample (randomised control trial) test that includes prior information. I have an existing ...
Amorphia's user avatar
  • 913
0 votes
0 answers
13 views

Do replicate weight, when the are available, supersede all the design-based procedures and analysis?

If the agency that conducted a survey and releases micro data at the individual or household level also supplies replicate weights for that sample, do these weights completely supercede any need or ...
andrewH's user avatar
  • 3,157
0 votes
0 answers
12 views

Whether to specify ddf in confint.svyglm()

I have a main question about whether to specify ddf in confint.svyglm(). Using the two specifications below generate slightly ...
Guoqiang Zhang's user avatar
0 votes
0 answers
23 views

Weighing Data Issue

I am looking at e-cig prevalence within a city. I used surveys to collect data from residents, and I have a query around weighing data. I have made the assumption, due to over and underrepresentation ...
Aidan's user avatar
  • 1
0 votes
0 answers
37 views

Post-hoc tests of interaction for a negative binomial regression with survey weights?

I ran a negative binomial regression with survey weights using the R package survey and the function svyglm() and observed a ...
Chernoff's user avatar
  • 308
2 votes
2 answers
77 views

Weighing Questions within a Survey

I work for a public child welfare agency. We have something called a level of care tool that categorizes children based on the amount of care they need. This helps us identify appropriate care options ...
ra_learns's user avatar
2 votes
0 answers
20 views

Use calibration weights to correct for unit non-response bias?

I have a question about how calibration weights can be used to sufficiently correct for unit non-response bias. Suppose the sample is s and the response set is r. Calibration is applied to the ...
Guoqiang Zhang's user avatar
1 vote
1 answer
154 views

How do I use something like predict.glm (in R) with a svyglm model and why don't my predictions match my data?

I'd like to estimate "cost" using some covariates with a weighted gamma model using svyglm. The weights sum to 1, and there are about 10,000 rows in the dataframe df total, with columns ...
Mark's user avatar
  • 203
4 votes
1 answer
101 views

Are there coventions on reporting weighted sample sizes?

Suppose I analyze survey data and calculate weighted means. Should I report the sample size if it differs from the unweighted sample size, as would be the case with non-normalized weights? It seems ...
mb1994's user avatar
  • 41
1 vote
0 answers
44 views

R survey package: compare weighted prevalence with literature result

I have a large dataset (>3000 cases) with results of serological tests. Based on these results i want to calculate the weighted prevalence of the disease. The data is categorical (0 = disease ...
Zoetzurechucky's user avatar
1 vote
0 answers
61 views

Multilevel Regression and Poststratification (MRP) weighting in the opposite direction to Poststratification alone

We are using MRP to derive test norms for an IQ test (Culture Fair Test, CFT) based on the TwinLife data. We adjust for age, sex, education, and migration background. Although a probability sample, ...
balout's user avatar
  • 11
0 votes
0 answers
30 views

Can I add a variable to a complex sample, and run a regression?

In a survey, a complex sample was collected, and the sample was designed to provide estimates at national level. In other words, individuals from one state were more likely to be sampled due to ...
Oalvinegro's user avatar
0 votes
0 answers
53 views

Adjusting the sampling probability during sampling

Let $n$ individuals in a statistical population be identified by $p.id \in \{1, 2, ..., n\}$ where the order is randomised. Assume, for simplicity, that all individuals are included in the sampling ...
LBogaardt's user avatar
  • 582
0 votes
0 answers
30 views

Complex survey design with multiple waves

The organization I work for has collected data from individuals in multiple waves. Their goal was to collect 333 individuals in 6 different groups (genderXgroup). If the first Wave did not reach 333 ...
StatsAreHard's user avatar
0 votes
0 answers
16 views

Sampling correlated clusters and calculating their weights

In a survey I want to do I have the following problem: I have 20 clusters, and I need to choose 5. Each cluster has 10 observations, and I need to choose 2. Lets say that the probability of choosing a ...
Nick's user avatar
  • 1
0 votes
1 answer
34 views

How to treat age-eligibility thresholds in household surveys (e.g. HRS)?

Most household surveys have age-eligibility thresholds. The HRS interviews individuals aged 51 and older, plus their spouse (if any) using PPS sampling. Do I need to drop individuals who are younger ...
cascom's user avatar
  • 41
1 vote
0 answers
45 views

Unbiased estimate of mean test score of pupils in a country (sampling frame of schools is avaible only)

My primary goal is to get unbiased estimate of mean test score of every pupil in a country. I have no sampling frame of all pupils to randomly sample from. But I have a sampling frame for every school....
Nothingman's user avatar
1 vote
1 answer
137 views

When to use replicate weights in complex survey analysis

I am curious about when it is recommended to use replicate weights in survey analysis. I compared the usual survey analysis with using replicate weights, as illustrated below. Based on the paper "...
Guoqiang Zhang's user avatar
0 votes
0 answers
14 views

Study changes of association between an independent variable and a categorical outcome with more than 2 categories

Suppose I would like to study the association between sex (male and female; independent variable) and taste preference (sweet, bitter, and sour; dependent variable). I have three cross-sectional ...
Guoqiang Zhang's user avatar
2 votes
1 answer
72 views

Different ways to define survey design object under MCAR assumption

I have a stratified random sample, and would like to conduct complete-case analysis, assuming Missing Completely At Random. However, I find that there seem to be two ways to define survey design ...
Guoqiang Zhang's user avatar
1 vote
2 answers
122 views

Is it reasonable to subset a survey design object by dependent (outcome) variable and fit a weighted logistic regression model?

I would like to study which factors are associated with an outcome which has more than two categories. After considering multinomial logistic regression model (which I find is very challenging to ...
Guoqiang Zhang's user avatar
4 votes
1 answer
96 views

Is it valid to compute survey weights using the distribution of a hypothetical/"imaginary" population?

I'm informally reviewing a study by a coworker, but as I'm not a specialist of survey weighting procedures, I need a second opinion about something I disagree with him. He collected a convenience ...
Daniela's user avatar
  • 57
0 votes
0 answers
55 views

Should we use survey::svyglm() to compute correct standard errors with WeightThem?

I need to use multiply imputed datasets and weights before using linear models (linear, logistic and negative binomial). I have 2 sources of informations that point to different directions to compute ...
Charly Marie's user avatar
2 votes
1 answer
69 views

writing a simple linear (and logistic) regression equation that incorporates survey weights

I am running a basic linear (and then logistic) regression that incorporates basic survey weights. I would like to specify the general regression equation with the survey weights for a manuscript. But ...
May Smith's user avatar
0 votes
0 answers
53 views

Longitudinal survey data weights

I'm dealing with a longitudinal survey data now. Do we need to use survey sample weights while running panel data models (e.g., Fixed Effects, Random Effects)? In other words, would it be fine if I ...
Greenhill's user avatar
  • 223
1 vote
1 answer
56 views

Is it possible to use poststratification when some observations have missing values on the variables used as strata?

This is a theoretical question, so I don't have data to share. Let's say I know the percentage of men and women in my population of interest, as well as the distribution of occupations and age ...
Cavdi's user avatar
  • 13
1 vote
1 answer
59 views

How to interpret survey data with demographic information?

I have little statistical experience, but am helping to run a community needs survey for my organization. Many of the questions have a yes/no or multiple choice answer format. For example, "Do ...
April's user avatar
  • 11
0 votes
0 answers
12 views

Longitudinal sample with varying levels of non-response

I'm using answers to a specific question from the General Social Survey ("do you feel rushed"?) to compare statistics over multiple years. The answers are "Always" "Sometimes&...
ryan_coogler's user avatar
1 vote
1 answer
77 views

Domain (subgroup) estimation in a stratified random sample

I want to ask a question about domain estimation (i.e., estimation of a parameter among subpopulations) in a stratified random sample. It seems to me that in a stratified random sample, domain ...
Guoqiang Zhang's user avatar
0 votes
0 answers
128 views

How can I reweight survey data

I have survey data from a complex survey with stratification, weights and clustering. I'm using the survey package in R to run regressions: ...
dash2's user avatar
  • 246
1 vote
1 answer
61 views

Variance test with survey weights

When not working with surveys, you do a variance test with var.test(). What do you do when you want to account for complex survey design?
Santiago Valdivieso's user avatar
1 vote
0 answers
23 views

Is there relationship between propensity score based causal inference and sampling weights?

Consider observational study with single outcome $Y$, single covariate $X$ and treatment assignment variable $W$. Under unconfounded treatment assignment assumption, $E_{sp}[Y(1)]=E[\frac{Y_i^{obs}W_i}...
user45765's user avatar
  • 1,445
0 votes
1 answer
41 views

Post stratification on a twophase object in the R survey package

I am working with NHANES survey data and I am trying to use a twophase survey design along with calibration to adjust for item non-response. For now, I am following an example from Lumley's Survey ...
tbuckley's user avatar
0 votes
1 answer
61 views

Can I make a proportional-to-size without replacement sample (PPS WOR) self weighted?

Let's say I have 100 schools and each has a different number of students. I want to estimate which % of students are in schools with electricity. Simulation and theory indicate it is more efficient to ...
Fernando Irarrázaval G's user avatar
0 votes
0 answers
93 views

Weighted logistic regression for complex survey design

I want to run a logistic regression on my dataset in R. I want to test the probability of which direction a fish is facing depending on my variables. I am considering using a weighted logistic ...
Dominique Turney's user avatar
1 vote
0 answers
198 views

Nb/Poisson regression with weighted survey data resulting in counts with decimals

I am analysing suicide counts and thus it seems appropriate to use Nb/Poisson regression. However, my counts come from survey data and are only whole numbers when unweighted. Once I realise the ...
Anthony Fish Hodgson's user avatar
2 votes
1 answer
171 views

Appropriate way to use post-stratification weights when running statistical tests SPSS

I have used Complex Samples in SPSS (and SUDAAN in SAS, Survey in R) when working with survey data that were collected using a sampling design that was not random. For example, when an oversample was ...
Brett Wyker's user avatar
1 vote
1 answer
615 views

how to understand the weights in PSM?

When using propensity score matching or weighting, a column of weights is generated that is used to estimate the effect of interest. According to a blog I read, there are three types of weights ...
Plumber's user avatar
  • 33
2 votes
0 answers
30 views

Where does weight factors come into play with real data (in R)?

I do understand the IMHO simple concept using weights to fit a sample to another (real) population. Calculating the weight factors is not hard. Regarding to my MWE there maybe is a more elegant R way ...
buhtz's user avatar
  • 282
2 votes
1 answer
85 views

Post-stratification with missing subpopulations in the survey

I have some survey data on a population described by age, gender, and weight. It’s quite skewed so I want to reweight it to a known target population (a larger ...
cgreen's user avatar
  • 1,002
0 votes
1 answer
341 views

Calculate risk ratio in weighted population

I have a propensity score weighted population (using IPTW) and I want to compute risk ratios on my weighted population. For this, I am using a weighted Poisson regression. Let's suppose that "...
user19745561's user avatar
3 votes
0 answers
85 views

What is the (Ratio estimator for the) covariance of two weighted means? [closed]

In a previous question I've asked How to estimate the (approximate) variance of the weighted mean?, specifically, how to prove the following formula: $$ \widehat{\sigma_{\bar{y}_w}^2} = \frac{1}{(\sum{...
Tal Galili's user avatar
  • 21.8k
5 votes
1 answer
168 views

svychisq with statistic="Chisq" vs. statistic="adjWald"

I'd like to run analyses using Rao-Scott Chi-squared test using svychisq to account for survey weights. It seems that statistic="Chisq" gives the Rao-Scott version, however, now that I'm ...
AskingSomeQuestions's user avatar
5 votes
1 answer
167 views

Why does the survey package in R and SPSS complex samples add-on give different standard errors?

I was comparing results that I generated in R for complex survey analysis using the survey package to results from SPSS using the complex samples analysis add-on. The sample size is large ~ N=5500 ...
s.stats's user avatar
  • 477
4 votes
1 answer
206 views

Generate nonnegative variates with mean 1 and specified variance-covariance

Problem In several applications in surveys, it would be helpful to be able to generate a set of $R$ $n$-dimensional variates with the following properties: Has mean vector $1$ Has a specified ...
bschneidr's user avatar
  • 452
0 votes
0 answers
49 views

Confidence intervals for binary proportions in a subdivided population

I am trying to estimate a confidence interval for the population proportion level of a binary (yes / no) variable. The population comes in three segments - easy to sample (I have a sample); hard to ...
Amorphia's user avatar
  • 913
0 votes
1 answer
169 views

Generlized linear model after multiple imputation on survey data

I am using a generalized linear mixed model after multiple imputation on survey data. However, after performing the analysis, I cannot extract random effects and confidence intervals for the estimates....
Flad's user avatar
  • 1
3 votes
1 answer
248 views

Design-based standard errors in svyglm but w/o weights or stratification

For inverse probability weighting (IPW) in R the use of survey::svyglm is well established. I want to compare the results of 10K+ (!) models with and without the ...
jay.sf's user avatar
  • 878
0 votes
0 answers
36 views

Is it appropriate to pre-stratify and post-stratify along different delineations of the same variables in a single survey?

I am working in the context of opt-in, web-based surveys. Often the desire is for accurate population estimates, and often at a country-wide level. The standard approach at this organization is to ...
spathartic's user avatar

15 30 50 per page
1
2 3 4 5