رگرسیون پواسون

در آمار، رگرسیون پواسون نوعی از تحلیل رگرسیون و زیرمجموعه‌ای از مدل‌های خطی تعمیم‌یافته است که برای تحلیل داده‌های حاصل از شمارش به کار می‌رود.اگر $displaystyle mathbf x in mathbb R ^n$ برداری از متغیر وابسته و مستقل باشد، فرم زیر را می‌گیرد:

displaystyle log(operatorname E (Y

که در آن $displaystyle mathbf a in mathbb R ^n$ و $displaystyle bin mathbb R$ . می‌توان فرم بالا را به این صورت نیز نوشت:

displaystyle log(operatorname E (Y

که در آن x بردار ( $displaystyle mathbf n+1$ )-بعدی از متغیرهاست. با داشتن پارامتر رگرسیون پواسون $displaystyle mathbf theta$ و بردار ورودی $mathbf x$ ، می‌توان پیش‌بینی را به اینصورت بدست آورد:

mathbf x )=e^left(boldsymbol theta 'mathbf x right).,

محتویات

۱ تخمین پارامترها بر اساس بیشینه درست نمایی

۲ ده سازی‌ها

۳ جستارهای وابسته

۴ منابع

تخمین پارامترها بر اساس بیشینه درست نمایی[ویرایش]

بردار متغیر وابسته $x$ است و $theta$ پارامتر مدل رگرسیون پوسان است، $Y$ متغیر مستقل است که آنرا با یک توزیع پوسان شبیه سازی میکنیم که میانگین آن در معادله پایین آمده است ^[۱]:

$displaystyle lambda :=operatorname E (Ymid x)=e^theta 'x,,$

از این رو تابع احتمال این توزیع برابر است با:

$displaystyle p(ymid x;theta )=frac lambda ^yy!e^-lambda =frac e^ytheta 'xe^-e^theta 'xy!$

حال اگر فرض کنیم که $m$ داده داریم یعنی $displaystyle (x_1,y_1),cdots ,(x_m,y_m)$ و مقادیر متغیر مستقل از مجموعه اعداد طبیعی می‌آید یعنی $displaystyle y_1,ldots ,y_min mathbb N$ و متغیرهای وابسته $displaystyle n+1$ هستند یعنی $displaystyle x_iin mathbb R ^n+1,,i=1,ldots ,m$ آنگاه احتمال متغیرهای مستقل به شرط مشاهده متغیرهای وابسته برابر خواهد شد با:‌

$displaystyle p(y_1,ldots ,y_mmid x_1,ldots ,x_m;theta )=prod _i=1^mfrac e^y_itheta 'x_ie^-e^theta 'x_iy_i!.$

حال بر حسب اصل بیشینه‌سازی درست نمایی باید به دنبال پارامتری بگردیم که این درست نمایی به بیشترین مقدار خود برسد، یعنی تابع پایین بیشینه شود: $displaystyle L(theta mid X,Y)=prod _i=1^mfrac e^y_itheta 'x_ie^-e^theta 'x_iy_i!.$

از آنجا که تابع لگاریتم مطلقاً صعودی است بجای بیشینه کردن تابع درست نمایی می‌توان لگاریتم آن را بیشینه کرد که تابع را ساده‌تر می‌کند. به عبارتی دیگر همان پارامتری که لگاریتم تابع درست نمایی را بیشینه می‌کند، همان پارامتر، خودِ تابع درست نمایی را نیز بیشنه می‌کند. لگاریتم تابع با معادله پایین برابر خواهد شد:

$displaystyle ell (theta mid X,Y)=log L(theta mid X,Y)=sum _i=1^mleft(y_itheta 'x_i-e^theta 'x_i-log(y_i!)right).$

از آنجا که $displaystyle sum _i=1^mlog(y_i!)$ ثابت است و پارامتر $displaystyle theta$ را در خود ندارد می‌توان آنرا از تابع حذف کرد و به تابع پایین رسید^[۱]

$displaystyle ell (theta mid X,Y)=sum _i=1^mleft(y_itheta 'x_i-e^theta 'x_iright).$

حال برای پیدا کردن بیشینه تابعِ $displaystyle ell (theta mid X,Y)$ باید گرادیان آنرا با صفر یکی کرد، یعنی $displaystyle frac partial ell (theta mid X,Y)partial theta =0$ . این معادله اما جوابی در فرم بسته ندارد و باید جواب آنرا از روشی دیگر پیدا کرد. از آنجا که $displaystyle -ell (theta mid X,Y)$ تابعی محّدب است، می‌توان به پارامتر بهینه یعنی پارامتری که $displaystyle -ell (theta mid X,Y)$ را کمینه و $displaystyle ell (theta mid X,Y)$ را بیشینه کند با روشهای بهینه‌سازی محّدب مانند گرادیان کاهشی رسید.

ده سازی‌ها[ویرایش]

Some statistics packages include implementations of Poisson regression.

متلب Statistics Toolbox: Poisson regression can be performed using the "glmfit" and "glmval" functions.^[۲]

مایکروسافت اکسل: Excel is not capable of doing Poisson regression by default. One of the Excel Add-ins for Poisson regression is XPost

آر (زبان برنامه‌نویسی): The function for fitting a generalized linear model in R is glm(), and can be used for Poisson Regression

ساس (نرم‌افزار): Poisson regression in SAS is done by using GENMOD

اس‌پی‌اس‌اس: In SPSS, Poisson regression is done by using the GENLIN command

Stata: Stata has a procedure for Poisson regression named "poisson"

mPlus: mPlus allows for Poisson regression using the command COUNT IS when specifying the data

جستارهای وابسته[ویرایش]

توزیع پواسون

رگرسیون خطی

رگرسیون لجیستیک

منابع[ویرایش]

Cameron, A.C. and P.K. Trivedi (1998). Regression analysis of count data, Cambridge University Press. ISBN 0-521-63201-3

Christensen، Ronald (۱۹۹۷). Log-linear models and logistic regression. Springer Texts in Statistics (ویراست Second). New York: Springer-Verlag. صص. xvi+۴۸۳. MR 1633357. شابک ۰-۳۸۷-۹۸۲۴۷-۷..mw-parser-output cite.citationfont-style:inherit.mw-parser-output qquotes:"""""""'""'".mw-parser-output code.cs1-codecolor:inherit;background:inherit;border:inherit;padding:inherit.mw-parser-output .cs1-lock-free abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center;padding-right:1em;padding-left:0.mw-parser-output .cs1-lock-limited a,.mw-parser-output .cs1-lock-registration abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center;padding-right:1em;padding-left:0.mw-parser-output .cs1-lock-subscription abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center;padding-right:1em;padding-left:0.mw-parser-output div[dir=ltr] .cs1-lock-subscription a,.mw-parser-output div[dir=ltr] .cs1-lock-limited a,.mw-parser-output div[dir=ltr] .cs1-lock-registration abackground-position:left .1em center;padding-left:1em;padding-right:0.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registrationcolor:#555.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration spanborder-bottom:1px dotted;cursor:help.mw-parser-output .cs1-hidden-errordisplay:none;font-size:100%.mw-parser-output .cs1-visible-errorfont-size:100%.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-formatfont-size:95%.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-leftpadding-left:0.2em.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-rightpadding-right:0.2em

Hilbe, J. M. (2007). Negative Binomial Regression, Cambridge University Press. ISBN 978-0-521-85772-7

↑ ^۱٫۰^۱٫۱ MacDonald, John M.; Berk, Richard (2008-09-01). "Overdispersion and Poisson Regression". Journal of Quantitative Criminology. 24 (3): 269–284. doi:10.1007/s10940-008-9048-4. ISSN 1573-7799..mw-parser-output cite.citationfont-style:inherit.mw-parser-output qquotes:"""""""'""'".mw-parser-output code.cs1-codecolor:inherit;background:inherit;border:inherit;padding:inherit.mw-parser-output .cs1-lock-free abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .cs1-lock-limited a,.mw-parser-output .cs1-lock-registration abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .cs1-lock-subscription abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output div[dir=ltr] .cs1-lock-subscription a,.mw-parser-output div[dir=ltr] .cs1-lock-limited a,.mw-parser-output div[dir=ltr] .cs1-lock-registration abackground-position:left .1em center.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registrationcolor:#555.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration spanborder-bottom:1px dotted;cursor:help.mw-parser-output .cs1-hidden-errordisplay:none;font-size:100%.mw-parser-output .cs1-visible-errorfont-size:100%.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-formatfont-size:95%.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-leftpadding-left:0.2em.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-rightpadding-right:0.2em

↑ http://www.mathworks.com/help/toolbox/stats/glmfit.html

[:0-1] ۱٫۰^۱٫۱ MacDonald, John M.; Berk, Richard (2008-09-01). "Overdispersion and Poisson Regression". Journal of Quantitative Criminology. 24 (3): 269–284. doi:10.1007/s10940-008-9048-4. ISSN 1573-7799..mw-parser-output cite.citationfont-style:inherit.mw-parser-output qquotes:"""""""'""'".mw-parser-output code.cs1-codecolor:inherit;background:inherit;border:inherit;padding:inherit.mw-parser-output .cs1-lock-free abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .cs1-lock-limited a,.mw-parser-output .cs1-lock-registration abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .cs1-lock-subscription abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output div[dir=ltr] .cs1-lock-subscription a,.mw-parser-output div[dir=ltr] .cs1-lock-limited a,.mw-parser-output div[dir=ltr] .cs1-lock-registration abackground-position:left .1em center.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registrationcolor:#555.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration spanborder-bottom:1px dotted;cursor:help.mw-parser-output .cs1-hidden-errordisplay:none;font-size:100%.mw-parser-output .cs1-visible-errorfont-size:100%.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-formatfont-size:95%.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-leftpadding-left:0.2em.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-rightpadding-right:0.2em

[2] ttp://www.mathworks.com/help/toolbox/stats/glmfit.html

ن ب و پژوهش‌های اجتماعی
جمع آوری داده‌ها	روش‌های گردآوری سرشماری نمونه گیری در پژوهش نمونه گیری تصادفی پرسشنامه مصاحبه ساختارمند نیمه-ساختاریافته بدون چارچوب
تحلیل داده‌ها	داده‌های رسته‌ای جدول پیشایندی سطوح سنجش آمار توصیفی تحلیل اکتشافی داده‌ها آمار چندمتغیره روان‌سنجی آمار استنباطی مدل آماری گرافی رگرسیون پواسون ساختاری
کاربردها	تحقیق بازار جمعیت‌شناسی نظرسنجی رهگیری نظرسنجی‌ها افکار عمومی
پژوهش‌های عمده	مطالعات انتخابات ملی آمریکا ‏(en)‏ گالوپ (شرکت) پژوهش اجتماعی عمومی ‏(en)‏ برنامه پژوهش اجتماعی بین المللی ‏(en)‏ سرشماری در ایران سرشماری در بریتانیا ‏(en)‏ سرشماری در ایالات متحده ‏(en)‏ پژوهش آزمون بهداشت، درمان و تغذیه ملی آمریکا ‏(en)‏ مطالعهٔ نگرشها و ارزشهای نیوزیلند ‏(en)‏ پژوهش ارزشهای جهانی ‏(en)‏
موسسات تخصصی	موسسه بین المللی آمار ‏(en)‏ انجمن جهانی نظرسنجی عمومی ‏(en)‏ انجمن آمریکایی نظرسنجی عمومی ‏(en)‏ انجمن اروپایی نظرسنجی و تحقیقات بازاریابی ‏(en)‏ مرکز تحقیقات پیو

搜尋此網誌

Dfrnhjy

رگرسیون پواسون

محتویات

تخمین پارامترها بر اساس بیشینه درست نمایی[ویرایش]

ده سازی‌ها[ویرایش]

جستارهای وابسته[ویرایش]

منابع[ویرایش]

منوی ناوبری

ابزارهای شخصی

فضاهای نام

گویش‌ها

بازدیدها

بیشتر

جستجو

بازدید محتوا

همکاری

نسخه‌برداری

ابزارها

به زبان‌های دیگر

Popular posts from this blog