Show simple item record

[journal article]

dc.contributor.authorSarracino, Francesco
dc.contributor.authorMikucka, Malgorzata
dc.date.accessioned2017-10-23T10:06:45Z
dc.date.available2017-10-23T10:06:45Z
dc.date.issued2017
dc.identifier.issn1864-3361
dc.identifier.urihttp://www.ssoar.info/ssoar/handle/document/54374
dc.description.abstract"Recent studies documented that survey data contain duplicate records. We assess how duplicate records affect regression estimates, and we evaluate the effectiveness of solutions to deal with duplicate records. Results show that the chances of obtaining unbiased estimates when data contain 40 doublets (about 5% of the sample) range between 3.5% and 11.5% depending on the distribution of duplicates. If 7 quintuplets are present in the data (2% of the sample), then the probability of obtaining biased estimates ranges between 11% and 20%. Weighting the duplicate records by the inverse of their multiplicity, or dropping superfluous duplicates outperform other solutions in all considered scenarios. Our results illustrate the risk of using data in presence of duplicate records and call for further research on strategies to analyze affected data." (author's abstract)en
dc.languageen
dc.subject.ddcSozialwissenschaften, Soziologiede
dc.subject.ddcSocial sciences, sociology, anthropologyen
dc.subject.otherduplicated observations; estimation bias; Monte Carlo simulation; inference
dc.titleBias and efficiency loss in regression estimates due to duplicated observations: a Monte Carlo simulation
dc.description.reviewbegutachtet (peer reviewed)de
dc.description.reviewpeer revieweden
dc.source.journalSurvey Research Methods
dc.source.volume11
dc.publisher.countryDEU
dc.source.issue1
dc.subject.classozErhebungstechniken und Analysetechniken der Sozialwissenschaftende
dc.subject.classozMethods and Techniques of Data Collection and Data Analysis, Statistical Methods, Computer Methodsen
dc.subject.thesozUmfrageforschungde
dc.subject.thesozsurvey researchen
dc.subject.thesozDatenqualitätde
dc.subject.thesozdata qualityen
dc.subject.thesozRegressionde
dc.subject.thesozregressionen
dc.subject.thesozSchätzungde
dc.subject.thesozestimationen
dc.rights.licenceDeposit Licence - Keine Weiterverbreitung, keine Bearbeitungde
dc.rights.licenceDeposit Licence - No Redistribution, No Modificationsen
internal.statusformal und inhaltlich fertig erschlossen
internal.identifier.thesoz10040714
internal.identifier.thesoz10055811
internal.identifier.thesoz10056459
internal.identifier.thesoz10057146
dc.type.stockarticle
dc.type.documentZeitschriftenartikelde
dc.type.documentjournal articleen
dc.source.pageinfo17-44
internal.identifier.classoz10105
internal.identifier.journal674
internal.identifier.document32
internal.identifier.ddc300
dc.identifier.doihttps://doi.org/10.18148/srm/2017.v11i1.7149
dc.description.pubstatusVeröffentlichungsversionde
dc.description.pubstatusPublished Versionen
internal.identifier.licence3
internal.identifier.pubstatus1
internal.identifier.review1
internal.check.abstractlanguageharmonizerCERTAIN


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record