Sự khác nhau giữa mô hình thống kê và mô hình xác suất?

Xác suất áp dụng là một nhánh quan trọng trong xác suất, bao gồm xác suất tính toán. Vì thống kê đang sử dụng lý thuyết xác suất để xây dựng các mô hình để xử lý dữ liệu, theo hiểu biết của tôi, tôi tự hỏi sự khác biệt cơ bản giữa mô hình thống kê và mô hình xác suất là gì? Mô hình xác suất không cần dữ liệu thực? Cảm ơn.

probability mathematical-statistics

— Hồng Vương
nguồn

Một xác suất mẫu bao gồm bộ ba $(\Omega,{\mathcal F},{\mathbb P})$ , nơi $\Omega$ là không gian mẫu, ${\mathcal F}$ là một $\sigma$ -algebra (sự kiện) và ${\mathbb P}$ là một thước đo khả năng trên ${\mathcal F}$ .

Giải thích trực quan . Một mô hình xác suất có thể được hiểu như là một tiếng biến ngẫu nhiên . Ví dụ: Đặt là biến ngẫu nhiên phân phối thông thường với giá trị trung bình và phương sai . Trong trường hợp này, thước đo xác suất được liên kết với Hàm phân phối tích lũy (CDF) thông qua $X$ $X$ $0$ $1$ ${\mathbb P}$ $F$

F (x) = P (X \leq x) = P (ω \in Ω : X (ω) \leq x) = \int_{- \infty}^{x} \frac{1}{\sqrt{2 π}} \exp (- \frac{t^{2}}{2}) d t .

$F(x)={\mathbb P}(X\leq x) = {\mathbb P}(\omega\in\Omega:X(\omega)\leq x) =\int_{-\infty}^x \dfrac{1}{\sqrt{2\pi}}\exp\left({-\dfrac{t^2}{2}}\right)dt.$

Khái quát hóa . Định nghĩa của Mô hình Xác suất phụ thuộc vào định nghĩa toán học của xác suất, xem ví dụ Xác suất miễn phí và Xác suất lượng tử .

Một mẫu thống kê là một bộ của các mô hình xác suất, đây là, một tập hợp các biện pháp xác / phân phối trên không gian mẫu . ${\mathcal S}$ $\Omega$

Tập phân phối xác suất này thường được chọn để mô hình hóa một hiện tượng nhất định mà chúng tôi có dữ liệu.

Giải thích trực quan . Trong Mô hình thống kê, cả hai tham số và phân phối mô tả một hiện tượng nhất định đều không xác định. Một ví dụ của việc này là familiy của phân phối chuẩn với trung bình và phương sai , đây là, cả hai thông số chưa được biết và bạn thường muốn sử dụng tập dữ liệu cho việc ước tính các thông số (ví dụ: chọn một phần tử của ). Điều này đặt các bản phân phối có thể được lựa chọn vào bất kỳ và , nhưng, nếu tôi không nhầm, trong một ví dụ thực tế chỉ có những người được xác định trên cùng một cặp $\mu\in{\mathbb R}$ $\sigma^2\in{\mathbb R_+}$ ${\mathcal S}$ $\Omega$ ${\mathcal F}$ $(\Omega,{\mathcal F})$ là hợp lý để xem xét.

Generalisations. This paper provides a very formal definition of Statistical Model, but the author mentions that "Bayesian model requires an additional component in the form of a prior distribution ... Although Bayesian formulations are not the primary focus of this paper". Therefore the definition of Statistical Model depend on the kind of model we use: parametric or nonparametric. Also in the parametric setting, the definition depends on how parameters are treated (e.g. Classical vs. Bayesian).

$\mbox{Normal}(\mu_0,\sigma_0^2)$ $\mu_0,\sigma_0^2$ $\mbox{Normal}(\mu,\sigma^2)$ , where $\mu,\sigma^2$ are unknown parameters.

None of them require a data set, but I would say that a Statistical model is usually selected for modelling one.

— Xi'an
nguồn

@HonglangWang That is correct to some extent. The main difference is that a probability model is only one (known) distribution, while a statistical model is a set of probability models; the data is used to select a model from this set or a smaller subset of models that better (in a certain sense) describe the phenomenon (in the light of the data).

(+1) This is a nice answer, though I have a couple of comments. First, I think this may be selling the probabilist a little bit short. It is not at all uncommon to consider a set of probability spaces in a probabilistic model, and indeed, the possible measures can even be random (constructed on a suitably larger space). Second, a Bayesian (in particular) might find this answer slightly disconcerting in that a Bayesian statistical model can often be viewed as a single probability model on a suitable product space

Ω \times Θ

$\Omega \times \Theta$ .

— cardinal

@gung This a more measure-theory-related question. Regarding your first question,

P

${\mathbb P}$ is indeed defined through the CDF. Now, the interpretation of

Ω

$\Omega$ is the difficult one because, formally,

P (X \leq x)

${\mathbb P}(X\leq x)$ means

P (ω \in Ω : X (ω) \leq x)

${\mathbb P}(\omega\in\Omega: X(\omega)\leq x)$ , then

Ω

$\Omega$ are not observable values.

F

${\mathcal F}$ is a

σ -

$\sigma-$ algebra which is the pre-image of the Borel

σ -

$\sigma-$ algebra under

X

$X$ , again this are not observable. I am not sure how to explain this in an intuitive level.

@gung

Ω

$\Omega$ depends on the application; it is not determined by theory. For instance,

Ω

$\Omega$ could be a set of Brownian motions describing the price of a financial derivative and

X

$X$ could be the value attained at a fixed time

t

$t$ . In another application

Ω

$\Omega$ could be a set of people and

X

$X$ could be the lengths of their forearms. Generally,

Ω

$\Omega$ is a mathematical model of the physical objects of study and

X

$X$ is a numerical property of those objects.

F

$\mathcal{F}$ is the set of possible events: those situations to which we want to ascribe probabilities.

— whuber

@gung

F

$\mathcal{F}$ is a sigma algebra: it's a collection of subsets (the "events"). In the financial application, it's a set of price histories; in the forearm measurements application, the events would be sets of people. We can talk about this more if you want in a chat room.

— whuber