MaxRa comments on What are some low-information priors that you find practically useful for thinking about the world?

MaxRa 22 Sep 2020 9:59 UTC
2 points
0 ∶ 0
I’m confused about the partition problem you linked to. Both examples in that post seem to be instances where in one partition available information is discarded.
Suppose you have a jar of blue, white, and black marbles, of unknown proportions. One is picked at random, and if it is blue, the light is turned on. If it is black or white, the light stays off (or is turned off). What is the probability the light is on?
There isn’t one single answer. In fact, there are several possible answers.
[1.] You might decide to assign a ¹⁄₂ probability to the light being on, because you’ve got no reason to assign any other odds. It’s either on (50%) or off (50%).
[2.] You could assign the blue marble a ¹⁄₃ probability of being selected (after all, you know that there are three colors). From this it would follow that you have a ¹⁄₃ chance of the light being on, and ²⁄₃ chance of the light being off.
Answer 1. seems to simply discard information about the algorithm that produces the result, i.e. that it depends on the color of the marbles. The same holds for the other example in the blogpost, where the information about the number of possible planets is ignored in one partition.
- AidanGoth 22 Sep 2020 10:46 UTC
  2 points
  0 ∶ 0
  Parent
  yeah, these aren’t great examples because there’s a choice of partition which is better than the others—thanks for pointing this out. The problem is more salient if instead, you suppose that you have no information about how many different coloured marbles there are and ask what the probability of picking a blue marble is. There are different ways of partitioning the possibilities but no obviously privileged partition. This is how Hilary Greaves frames it here.
  Another good example is van Fraassen’s cube factory, e.g. described here.
  - MaxRa 22 Sep 2020 11:51 UTC
    1 point
    0 ∶ 0
    Parent
    Thanks a lot for the pointers! Greaves’ example seems to suffer the same problem, though, doesn’t it?
    Suppose, for instance, you know only that I am about to draw a book from my shelf, and that each book on my shelf has a single-coloured cover. Then POI seems to suggest that you are rationally required to have credence ½ that it will be red (Q1=red, Q2 = not-red; and you have no evidence bearing on whether or not the book is red), but also that you are rationally required to have credence 1/n that it will be red, where n is the ‘number of possible colours’ (Qi = ith colour; and you have no evidence bearing on what colour the book is).)
    We have information about the set and distribution of colors, and assigning 50% credence to the color red does not use that information.
    The cube factory problem does suffer less from this, cool!
    A factory produces cubes with side-length between 0 and 1 foot; what is the probability that a randomly chosen cube has side-length between 0 and ¹⁄₂ a foot? The classical intepretation’s answer is apparently ¹⁄₂, as we imagine a process of production that is uniformly distributed over side-length. But the question could have been given an equivalent restatement: A factory produces cubes with face-area between 0 and 1 square-feet; what is the probability that a randomly chosen cube has face-area between 0 and ¹⁄₄ square-feet? Now the answer is apparently ¹⁄₄, as we imagine a process of production that is uniformly distributed over face-area.
    I wonder if one should simply model this hierarchically, assigning equal credence to the idea that the relevant measure in cube production is side length or volume. For example, we might have information about cube bottle customers that want to fill their cubes with water. Because the customers vary in how much water they want to fit in their cube bottles, it seems to me that we should put more credence into partitioning it according to volume. Or if we’d have some information that people often want to glue the cubes under their shoes to appear taller, the relevant measure would be the side length. Currently, we have no information like this, so we should assign equal credence to both measures.
    - AidanGoth 22 Sep 2020 18:55 UTC
      2 points
      0 ∶ 0
      Parent
      I don’t think Greaves’ example suffers the same problem actually—if we truly don’t know anything about what the possible colours are (just that each book has one colour), then there’s no reason to prefer {red, yellow, blue, other} over {red, yellow, blue, green, other}.
      In the case of truly having no information, I think it makes sense to use Jeffreys prior in the box factory case because that’s invariant to reparametrisation, so it doesn’t matter whether the problem is framed in terms of length, area, volume, or some other parameterisation. I’m not sure what that actually looks like in this case though
      - MaxRa 23 Sep 2020 19:54 UTC
        1 point
        0 ∶ 0
        Parent
        Hm, but if we don’t know anything about the possible colours, the natural prior to assume seems to me to give all colors the same likelihood. It seems arbitrary to decide to group a subsection of colors under the label “other”, and pretend like it should be treated like a hypothesis on equal footing with the others in your given set, which are single colors.
        Yeah, Jeffreys prior seems to make sense here.