So this is a conditional probability question that is giving me a really hard time:
This is a sequence, and its contingency table:
C C C C C B C C C C C C B A A A A A C C C C C A A C A A B A A A B C A A A A A A
C C A A C A A A A A A A A B B C C A C A A A B B C C B A A C C C C B A A A B B A
A B C B B B C B B C B B B C C B A C A A A A B A A B A A A A C B B B B B B B B B
B B B B C A B B C C B B B B A B B B B C C B B B B B C B C A A A C C C A B B A A
B B C C C A B B B B B A B B A A C A A A B B B C B B B B B C C B B C B B B A A A
A A A A A B B B C C C C C C B B B C A A A C C A C B B B B B B B C A A B B B C C
C C C C B A B B B C C C C C C A A C C C C C C C C C C B B B B B B B B C B A C B
B A A A C B A A C C A C C C C C B B B A
A2 B2 C2
A1 54 18 17 89
B1 18 71 23 112
C1 18 23 57 98
90 112 97 299
Explanation for table: Each sequence has 300 letters total, but because the contingency table is counting the number of times that a first observed letter is followed by a second observed letter, that means are counting pairs, so there are 299 pairs. To look up the number of times that we had a pair of letters where the first was an A (i.e., A1) and the second was B (i.e., B2), we look up the corresponding cell in the contingency table; for sequence 1, ‘AB’ occurred 32 times. The yellow margin to the right sums up the total times that we saw a letter appear as the first letter in a pair (irrespective of the second), and likewise the bottom yellow margin sums up the number of times the second letter appeared irrespective of the first
Question: 3a (.5 point): Using the table for the sequence, find the marginal probabilities P(X2 = A), P(X2 = B) and P(X2 = C).
P(X2 = A) = 90/299
P(X2 = B) = 112/299
P (X2 = C) = 97/299
3b (.5 point): Suppose you followed strategy 1 and always predicted X2 = B without ever looking at X1. Of the 299 opportunities to predict X2, how often would you be right if you predict X2 = B?
P(X2 = B) = 112/299
3c (.5 point): Using the table for sequence 2, find the conditional probabilities P(X2 = A | X1 = A), P(X2 = B | X1 = B) and P(X2 = C | X1 = C).
P(X2 = A | X1 = A) = P(X2 = A and X1 = A)/ P(X1= A) = 54/89
P(X2 = B | X1 = B) = P(X2 = B and X1 = B)/ P(X1= B) = 71/112
P(X2 = C | X1 = C) = P(X2 = C and X1 = C)/ P(X1= C) = 57/98
3d (.5 point): Suppose you followed strategy 2 and predicted that X2 will be the same as X1. Use the marginal frequencies of A, B and C for sequence 2, and the conditional probabilities you just calculated (in 3c) to compute how often you would be right.
This is where I'm stuck. I have no idea what marginal probabilities they are referring to, since I already used the marginal probabilities in computing the conditional ones. Even if I manage to find that out, I have no idea whether to multiply, add or divide it with the conditional ones.
Any help would really, really be appreciated!
Thank you!
Gender:
Points: 497
Reviews: 11