Tại sao


15

Tôi cho rằng

P ( A | B ) = P ( A | B , C ) * P ( C ) + P ( A | B , ¬ C ) * P ( ¬ C )

P(A|B)=P(A|B,C)P(C)+P(A|B,¬C)P(¬C)

là chính xác, trong khi

P ( A | B ) = P ( A | B , C ) + P ( A | B , ¬ C )

P(A|B)=P(A|B,C)+P(A|B,¬C)

không chính xác

Tuy nhiên, tôi đã có một "trực giác" về cái sau, nghĩa là bạn xem xét xác suất P (A | B) bằng cách tách hai trường hợp (C hoặc Không C). Tại sao trực giác này là sai?


4
Đây là một ví dụ đơn giản để kiểm tra phương trình của bạn. Toss hai đồng tiền độc lập, công bằng. Gọi AA là sự kiện mà người đầu tiên xuất hiện, BB là sự kiện mà người thứ hai xuất hiện và CC là sự kiện mà cả hai cùng đứng đầu. Là một trong những phương trình bạn đã viết đúng?
A. Rex

4
Định luật tổng xác suất cho biết nếu bạn muốn biểu thị xác suất vô điều kiện là tổng xác suất có điều kiện, bạn phải cân nhắc theo sự kiện điều hòa: ví dụ P ( A ) = P ( A | B ) P ( B ) + P ( A | ˉ B ) P ( ˉ B )P(A)=P(A|B)P(B)+P(A|B¯)P(B¯)
Adamo

Câu trả lời:


25

Giả sử, là một ví dụ truy cập dễ dàng, rằng xác suất P ( A )P(A) của AA11 , không phụ thuộc vào giá trị của CC . Sau đó, nếu chúng ta lấy phương trình không chính xác , chúng ta sẽ nhận được:

P ( A | B ) = P ( A | B , C ) + P ( A | B , ¬ C ) = 1 + 1 = 2P(A|B)=P(A|B,C)+P(A|B,¬C)=1+1=2

Điều đó rõ ràng là không thể chính xác, có lẽ không thể lớn hơn 1 . Điều này giúp xây dựng trực giác rằng bạn nên gán trọng số cho mỗi trong hai trường hợp tỷ lệ thuận với khả năng của trường hợp đó, dẫn đến phương trình (đúng) đầu tiên. .1


Điều đó mang bạn đến gần hơn với phương trình đầu tiên của bạn, nhưng các trọng số không hoàn toàn đúng. Xem bình luận của A. Rex 'cho các trọng số chính xác.


1
Should the weights in the "first (correct) equation" be P(C)P(C) and P(¬C)P(¬C), or should they be P(CB)P(CB) and P(¬CB)P(¬CB)?
A. Rex

@A.Rex That's a good point, for full correctness I think it should be P(C|B)P(C|B) and P(¬C|B)P(¬C|B). Everything (just a single term) on the left-hand side of the equation assumes that BB is given, so without any additional assumptions (like assuming that BB and CC are independent of each other), the same should be the case on the right-hand side
Dennis Soemers

Just think of A|B being 200% sure to happen.
Mark L. Stone

@MarkL.Stone Does that mean it always happens twice? ;)
Reinstate Monica

9

Dennis's answer has a great counter-example, disproving the wrong equation. This answer seeks to explain why the following equation is right:

P(A|B)=P(A|C,B)P(C|B)+P(A|¬C,B)P(¬C|B).

P(A|B)=P(A|C,B)P(C|B)+P(A|¬C,B)P(¬C|B).

As every term is conditioned on BB, we can replace the entire probability space by BB and drop the BB term. This gives us:

P(A)=P(A|C)P(C)+P(A|¬C)P(¬C).

P(A)=P(A|C)P(C)+P(A|¬C)P(¬C).

Then you are asking why this equation has the P(C)P(C) and P(¬C)P(¬C) terms in it.

The reason is that P(A|C)P(C)P(A|C)P(C) is the portion of AA in CC and P(A|¬C)P(¬C)P(A|¬C)P(¬C) is the portion of AA in ¬C¬C and the two add up to AA. See diagram. On the other hand P(A|C)P(A|C) is the proportion of CC containing AA and P(A|¬C)P(A|¬C) is the proportion of ¬C¬C containing AA - these are proportions of different regions so they don't have common denominators so adding them is meaningless.

pic


2
Not "everything is conditioned on BB". In particular, P(C)P(C) and P(¬C)P(¬C) are not, so you can't just drop BB. Moreover, this might suggest the equation is wrong!
A. Rex

@A.Rex Technically you're right, I should have said every term involving AA is conditioned on BB (I made a simple substitution A|BAA|BA). I will correct the answer.
Reinstate Monica

5
My objection wasn't a technicality. Your diagram correctly proves that P(A)=P(AC)P(C)+P(A¬C)P(¬C)P(A)=P(AC)P(C)+P(A¬C)P(¬C), which after conditioning on BB becomes P(AB)=P(AB,C)P(CB)+P(AB,¬C)P(¬CB)P(AB)=P(AB,C)P(CB)+P(AB,¬C)P(¬CB); note that the probabilities of CC and ¬C¬C are also conditioned on BB. This is not the first equation given in the OP, which is good news, because the first equation given in the OP is not correct.
A. Rex

@A.Rex You are right once again, CC must also conditioned on BB as the proportion of the probability space contained in CC might not be the same as the proportion of BB contained in CC. This point escaped me. I will revise again.
Reinstate Monica

7

I know you've already received two great answers to your question, but I just wanted to point out how you can turn the idea behind your intuition into the correct equation.

First, remember that P(XY)=P(XY)P(Y)P(XY)=P(XY)P(Y) and equivalently P(XY)=P(XY)P(Y)P(XY)=P(XY)P(Y).

To avoid making mistakes, we will use the first equation in the previous paragraph to eliminate all conditional probabilities, then keep rewriting expressions involving intersections and unions of events, then use the second equation in the previous paragraph to re-introduce the conditionals at the end. Thus, we start with: P(AB)=P(AB)P(B)

P(AB)=P(AB)P(B)

We will keep rewriting the right-hand side until we get the desired equation.

The casework in your intuition expands the event AA into (AC)(A¬C)(AC)(A¬C), resulting in P(AB)=P(((AC)(A¬C))B)P(B)

P(AB)=P(((AC)(A¬C))B)P(B)

As with sets, the intersection distributes over the union: P(AB)=P((ABC)(AB¬C))P(B)

P(AB)=P((ABC)(AB¬C))P(B)

Since the two events being unioned in the numerator are mutually exclusive (since CC and ¬C¬C cannot both happen), we can use the sum rule: P(AB)=P(ABC)P(B)+P(AB¬C)P(B)

P(AB)=P(ABC)P(B)+P(AB¬C)P(B)

We now see that P(AB)=P(ACB)+P(A¬CB)P(AB)=P(ACB)+P(A¬CB); thus, you can use the sum rule on the event on the event of interest (the "left" side of the conditional bar) if you keep the given event (the "right" side) the same. This can be used as a general rule for other equality proofs as well.

We re-introduce the desired conditionals using the second equation in the second paragraph: P(A(BC))=P(ABC)P(BC)

P(A(BC))=P(ABC)P(BC)
and similarly for ¬C¬C.

We plug this into our equation for P(AB)P(AB) as: P(AB)=P(ABC)P(BC)P(B)+P(AB¬C)P(B¬C)P(B)

P(AB)=P(ABC)P(BC)P(B)+P(AB¬C)P(B¬C)P(B)

Noting that P(BC)P(B)=P(CB)P(BC)P(B)=P(CB) (and similarly for ¬C¬C), we finally get P(AB)=P(ABC)P(CB)+P(AB¬C)P(¬CB)

P(AB)=P(ABC)P(CB)+P(AB¬C)P(¬CB)

Which is the correct equation (albeit with slightly different notation), including the fix A. Rex pointed out.

Note that P(ACB)P(ACB) turned into P(ABC)P(CB)P(ABC)P(CB). This mirrors the equation P(AC)=P(AC)P(C)P(AC)=P(AC)P(C) by adding the BB condition to not only P(AC)P(AC) and P(AC)P(AC), but also P(C)P(C) as well. I think if you are to use familiar rules on conditioned probabilities, you need to add the condition to all probabilities in the rule. And if there's any doubt whether that idea works for a particular situation, you can always expand out the conditionals to check, as I did for this answer.


2
+1. I think you extracted the equation that OP tried to intuit: P(AB)=P(ACB)+P(A¬CB)P(AB)=P(ACB)+P(A¬CB).
A. Rex

Thanks! That was the main point I wanted to make, but couldn't figure out a high-level explanation why the intersection goes on the left rather than the right, so I used formulas instead. Also, I just noticed you were the one who pointed out the mistake in OP's formula, so I credited you for that. (I probably wouldn't have noticed either, lol.)
YawarRaza7349

2

Probabilities are ratios; the probability of A given B is how often A happens within the space of B. For instance, P(rain|March)P(rain|March) is the number of rainy days in March divided by the number of total days in March. When dealing with fractions, it makes sense to split up numerators. For instance,

P(rain or snow|March)=(number of rainy or snowy days in March)(total number of days in March)=(number of rainy days in March)(total number of days in March)+(number of snowy days in March)(total number of days in March)=P(rain|March)+P(snow|March)

P(rain or snow|March)=(number of rainy or snowy days in March)(total number of days in March)=(number of rainy days in March)(total number of days in March)+(number of snowy days in March)(total number of days in March)=P(rain|March)+P(snow|March)

This of course assumes that "snow" and "rain" are mutually exclusive. It does not, however, make sense to split up denominators. So if you have P(rain|February or March)P(rain|February or March), that is equal to

(number of rainy days in February and March)(total number of days in February and March).

(number of rainy days in February and March)(total number of days in February and March).

But that is not equal to

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March).

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March).

If you're having trouble seeing that, you can try out some numbers. Suppose there are 10 rainy days in February and 8 in March. Then we have

(number of rainy days in February and March)(total number of days in February and March)=(10+8)/(28+31)=29.5%

(number of rainy days in February and March)(total number of days in February and March)=(10+8)/(28+31)=29.5%

and

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March)=(10/28)+(8/31)=35.7%+25.8%=61.5%

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March)=(10/28)+(8/31)=35.7%+25.8%=61.5%

The first number, 29.5%, is the average of 35.7% and 25.8% (with the second number weighted slightly more because there is are more days in March). When you say P(A|B)=P(A|B,C)+P(A|B,¬C)P(A|B)=P(A|B,C)+P(A|B,¬C) you're saying that x1+x2y1+y2=x1y1+x2y2, which is false.


1

If I go to Spain, I can get sunburnt. P(sunburnt|Spain)=0.2

This tells me nothing about getting sunburnt if not going to Spain, let's say P(sunburnt|¬Spain)=0.1
This year I'm going to Spain, so P(sunburnt)=0.2
Letting B=Ω, this is, P(B)=1, your intuition would imply P(A)=P(A|C)+P(A|¬C)
which by the previous argument, isn't neccesarily true.
Khi sử dụng trang web của chúng tôi, bạn xác nhận rằng bạn đã đọc và hiểu Chính sách cookieChính sách bảo mật của chúng tôi.
Licensed under cc by-sa 3.0 with attribution required.