共查询到20条相似文献,搜索用时 49 毫秒
1.
Mean profiles are widely used as indicators of the electricity consumption habits of customers. Currently, in Électricité De France (EDF), class load profiles are estimated using point‐wise mean profiles. Unfortunately, it is well known that the mean is highly sensitive to the presence of outliers, such as one or more consumers with unusually high‐levels of consumption. In this paper, we propose an alternative to the mean profile: the L 1 ‐ median profile which is more robust. When dealing with large data sets of functional data (load curves for example), survey sampling approaches are useful for estimating the median profile avoiding storing the whole data. We propose here several sampling strategies and estimators to estimate the median trajectory. A comparison between them is illustrated by means of a test population. We develop a stratification based on the linearized variable which substantially improves the accuracy of the estimator compared to simple random sampling without replacement. We suggest also an improved estimator that takes into account auxiliary information. Some potential areas for future research are also highlighted. 相似文献
2.
We present a modern perspective of the conditional likelihood approach to the analysis of capture‐recapture experiments, which shows the conditional likelihood to be a member of generalized linear model (GLM). Hence, there is the potential to apply the full range of GLM methodologies. To put this method in context, we first review some approaches to capture‐recapture experiments with heterogeneous capture probabilities in closed populations, covering parametric and non‐parametric mixture models and the use of covariates. We then review in more detail the analysis of capture‐recapture experiments when the capture probabilities depend on a covariate. 相似文献
3.
When sampling a batch consisting of particulate material, the distribution of a sample estimator can be characterized using knowledge about the sample drawing process. With Bernoulli sampling, the number of particles in the sample is binomially distributed. Because this is rarely realized in practice, we propose a sampling design in which the possible samples have a nearly equal mass. Expected values and variances of the sample estimator are calculated. It is shown that the sample estimator becomes identical to the Horvitz–Thompson estimator in the case of a large batch-to-sample mass ratio and a large sample mass. Simulations and experiments were performed to test the theory. Simulations confirm that the round-off error due to the discrete nature of particles is negligible for large sample sizes. Sampling experiments were carried out with a mixture of PolyPropylene (PP) and PolyTetraFluorEthylene (PTFE) spheres suspended in a viscous medium. The measured and theoretical variations are in good agreement. 相似文献
4.
This paper deals with a model-based variance estimation of the Horvitz–Thompson (HT) estimator when auxiliary information is available. A small simulation study is carried out to illustrate and establish some of the findings. 相似文献
5.
Calibration estimation, where the sampling weights are adjusted to make certain estimators match known population totals, is commonly used in survey sampling. The generalized regression estimator is an example of a calibration estimator. Given the functional form of the calibration adjustment term, we establish the asymptotic equivalence between the functional-form calibration estimator and an instrumental variable calibration estimator where the instrumental variable is directly determined from the functional form in the calibration equation. Variance estimation based on linearization is discussed and applied to some recently proposed calibration estimators. The results are extended to the estimator that is a solution to the calibrated estimating equation. Results from a limited simulation study are presented. 相似文献
6.
Maria del Mar Rueda Antonio Arcos David Molina Maria Giovanna Ranalli 《Revue internationale de statistique》2018,86(1):51-67
Surveys usually include questions where individuals must select one in a series of possible options that can be sorted. On the other hand, multiple frame surveys are becoming a widely used method to decrease bias due to undercoverage of the target population. In this work, we propose statistical techniques for handling ordinal data coming from a multiple frame survey using complex sampling designs and auxiliary information. Our aim is to estimate proportions when the variable of interest has ordinal outcomes. Two estimators are constructed following model‐assisted generalised regression and model calibration techniques. Theoretical properties are investigated for these estimators. Simulation studies with different sampling procedures are considered to evaluate the performance of the proposed estimators in finite size samples. An application to a real survey on opinions towards immigration is also included. 相似文献
7.
Postulating a super-population regression model connecting a size variable, a cheaply measurable variable and an expensively observable variable of interest, an asymptotically optimal double sampling strategy to estimate the survey population total of the third variable is specified. To render it practicable, unknown model-parameters in the optimal estimator are replaced by appropriate statistics. The resulting generalized regression estimator is then shown to have a model-cum-asymptotic design based expected square error equal to that of the asymptotically optimum estimator itself. An estimator for design variance of the estimator is also proposed. 相似文献
8.
Postulating a linear regression of a variable of interest on an auxiliary variable with values of the latter known for all
units of a survey population, we consider appropriate ways of choosing a sample and estimating the regression parameters.
Recalling Thomsen’s (1978) results on non-existence of ‘design-cum-model’ based minimum variance unbiased estimators of regression
coefficients we apply Brewer’s (1979) ‘asymptotic’ analysis to derive ‘asymptotic-design-cummodel’ based optimal estimators
assuming large population and sample sizes. A variance estimation procedure is also proposed. 相似文献
9.
The effect of sampling people through households is considered. Results on design effects for two stage surveys are reviewed and applied to give design effects of household samples. The main factors that determine the design effect are identified for the designs in which one person, or all people, are selected from each selected household. Within household correlation is one factor. We show that the relationships between household size and the mean and variance within households are also important factors. Census and survey data are used to empirically compare the design effects for a range estimators, variables and designs. 相似文献
10.
In this paper, an alternative sampling procedure that is a mixture of simple random sampling and systematic sampling is proposed. It results in uniform inclusion probabilities for all individual units and positive inclusion probabilities for all pairs of units. As a result, the proposed sampling procedure enables us to estimate the population mean unbiasedly using the ordinary sample mean, and to provide an unbiased estimator of its sampling variance. It is also found that the suggested sampling procedure performs well especially when the size of simple random sample is small.
Received August 2001 相似文献
11.
Variance estimation for unequal probability sampling 总被引:1,自引:0,他引:1
Guohua Zou 《Metrika》1999,50(1):71-82
In this paper, we discuss the optimality of the variance estimator of the Horvitz-Thompson estimator proposed by Kott (1988)
in the class of model-unbiased quadratic estimators. We also propose some improved estimators over Kott's estimator in the
class of general quadratic estimators.
Received: February 1999 相似文献
12.
We study the generalized bootstrap technique under general sampling designs. We focus mainly on bootstrap variance estimation but we also investigate the empirical properties of bootstrap confidence intervals obtained using the percentile method. Generalized bootstrap consists of randomly generating bootstrap weights so that the first two (or more) design moments of the sampling error are tracked by the corresponding bootstrap moments. Most bootstrap methods in the literature can be viewed as special cases. We discuss issues such as the choice of the distribution used to generate bootstrap weights, the choice of the number of bootstrap replicates, and the potential occurrence of negative bootstrap weights. We first describe the generalized bootstrap for the linear Horvitz‐Thompson estimator and then consider non‐linear estimators such as those defined through estimating equations. We also develop two ways of bootstrapping the generalized regression estimator of a population total. We study in greater depth the case of Poisson sampling, which is often used to select samples in Price Index surveys conducted by national statistical agencies around the world. For Poisson sampling, we consider a pseudo‐population approach and show that the resulting bootstrap weights capture the first three design moments of the sampling error. A simulation study and an example with real survey data are used to illustrate the theory. 相似文献
13.
Small area estimation typically requires model‐based methods that depend on isolating the contribution to overall population heterogeneity associated with group (i.e. small area) membership. One way of doing this is via random effects models with latent group effects. Alternatively, one can use an M‐quantile ensemble model that assigns indices to sampled individuals characterising their contribution to overall sample heterogeneity. These indices are then aggregated to form group effects. The aim of this article is to contrast these two approaches to characterising group effects and to illustrate them in the context of small area estimation. In doing so, we consider a range of different data types, including continuous data, count data and binary response data. 相似文献
14.
Andrius Čiginas 《Statistica Neerlandica》2020,74(2):145-158
We consider methods for estimating the means of survey variables in domains of a finite population, where sample sizes are too small to obtain reliable direct estimates. We construct generalized compositions from the direct and traditional design-based synthetic estimators and propose the methodology for evaluating their coefficients. This methodology measures similarities among sample elements and estimates of the domain means. We propose the compositions for two cases of auxiliary information: domain-level characteristics are available; true means of auxiliary variables are available for the estimation domains, and unit-level auxiliary vectors are known for the sample elements. In the simulation study, we show where the generalized compositions improve the traditional synthetic and composite estimators. 相似文献
15.
This article is concerned with the inference on seemingly unrelated non‐parametric regression models with serially correlated errors. Based on an initial estimator of the mean functions, we first construct an efficient estimator of the autoregressive parameters of the errors. Then, by applying an undersmoothing technique, and taking both of the contemporaneous correlation among equations and serial correlation into account, we propose an efficient two‐stage local polynomial estimation for the unknown mean functions. It is shown that the resulting estimator has the same bias as those estimators which neglect the contemporaneous and/or serial correlation and smaller asymptotic variance. The asymptotic normality of the resulting estimator is also established. In addition, we develop a wild block bootstrap test for the goodness‐of‐fit of models. The finite sample performance of our procedures is investigated in a simulation study whose results come out very supportive, and a real data set is analysed to illustrate the usefulness of our procedures. 相似文献
16.
《Revue internationale de statistique》2017,85(2):250-269
The use of auxiliary variables to improve the efficiency of estimators is a well‐known strategy in survey sampling. Typically, the auxiliary variables used are the totals of appropriate measurement that are exactly known from registers or administrative sources. Increasingly, however, these totals are estimated from surveys and are then used to calibrate estimators and improve their efficiency. We consider different types of survey structures and develop design‐based estimators that are calibrated on known as well as estimated totals of auxiliary variables. The optimality properties of these estimators are studied. These estimators can be viewed as extensions of the Montanari generalised regression estimator adapted to the more complex situations. The paper studies interesting special cases to develop insights and guidelines to properly manage the survey‐estimated auxiliary totals. 相似文献
17.
Dr. H. Milbrodt 《Metrika》1987,34(1):275-281
Summary Supplementing results of Hájek (1960, 1964) sufficient conditions for Poisson convergence of the Horvitz-Thompson estimator in pps Rejective sampling are obtained. 相似文献
18.
Graham Kalton 《Revue internationale de statistique》2019,87(Z1):S10-S30
Many developments have occurred in the practice of survey sampling and survey methodology in the past 60 years or so. These developments have been partly driven by the emergence of computers and the continuous growth in computer power over the years and partly by the increasingly sophisticated demands from the users of survey data. The paper reviews these developments with a main emphasis on survey sampling issues for the design and analysis of social surveys. Design‐based inference based on probability samples was the predominant approach in the early years, but over time, that predominance has been eroded by the need to employ model‐dependent methods to deal with missing data and to satisfy analysts' demands for survey estimates that cannot be met with design‐based methods. With the continuous decline in response rates that has occurred in recent years, much current research has focused on the use of non‐probability samples and data collected from administrative records and web surveys. 相似文献
19.
Jeffrey S. Simonoff 《Revue internationale de statistique》1998,66(2):137-156
The past forty years have seen a great deal of research into the construction and properties of nonparametric estimates of smooth functions. This research has focused primarily on two sides of the smoothing problem: nonparametric regression and density estimation. Theoretical results for these two situations are similar, and multivariate density estimation was an early justification for the Nadaraya-Watson kernel regression estimator.
A third, less well-explored, strand of applications of smoothing is to the estimation of probabilities in categorical data. In this paper the position of categorical data smoothing as a bridge between nonparametric regression and density estimation is explored. Nonparametric regression provides a paradigm for the construction of effective categorical smoothing estimates, and use of an appropriate likelihood function yields cell probability estimates with many desirable properties. Such estimates can be used to construct regression estimates when one or more of the categorical variables are viewed as response variables. They also lead naturally to the construction of well-behaved density estimates using local or penalized likelihood estimation, which can then be used in a regression context. Several real data sets are used to illustrate these points. 相似文献
A third, less well-explored, strand of applications of smoothing is to the estimation of probabilities in categorical data. In this paper the position of categorical data smoothing as a bridge between nonparametric regression and density estimation is explored. Nonparametric regression provides a paradigm for the construction of effective categorical smoothing estimates, and use of an appropriate likelihood function yields cell probability estimates with many desirable properties. Such estimates can be used to construct regression estimates when one or more of the categorical variables are viewed as response variables. They also lead naturally to the construction of well-behaved density estimates using local or penalized likelihood estimation, which can then be used in a regression context. Several real data sets are used to illustrate these points. 相似文献
20.
Raghunath Arnab 《Metrika》2001,54(2):159-177
The problems of estimating population total in multi-charter surveys are considered in a unified set up. Alternative estimators for Rao-Hartley-Cochran (1962), Midzuno-Sen (1952,53) and other varying probability sampling schemes are proposed when the measure of size is not well related to the study variables. Some of the proposed estimators are found superior to the existing alternatives. A numerical study is carried out to investigate the performances of the proposed alternatives. 相似文献