Jonas Moss

Karma: 201

Postdoc in statistics. Three kids, two cats, one wife. I write about statistics, EA, psychometrics, and other things at my blog

Jonas Moss May 30, 2023, 7:35 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Relative Value Functions: A Flexible New Format for Value Estimation
I’m not sure what you mean. I’m thinking about pairwise comparisons in the following way.

(a) Every pair of items $i, j$ have a true ratio of expectations $E (X_{i}) / E (X_{j}) = μ_{i j}$ . I hope this is uncontroversial. (b) We observe the variables $R_{i j}$ according to $log R_{i j} \sim log μ_{i j} + ϵ_{i j}$ for some some normally distributed $ϵ_{i j}$ . Error terms might be dependent, but that complicates the analysis. (And is most likely not worth it.) This step could be more controversial, as there are other possible models to use.

Note that you will get a distribution over every $E (X_{i})$ too with this approach, but that would be in the Bayesian sense, i.e., $p (E (X_{i}) ∣ comparisons)$ , when we have a prior over $E (X_{i})$ .

Jonas Moss May 29, 2023, 4:51 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Relative Value Functions: A Flexible New Format for Value Estimation
I don’t understand your notion of context here. I’m understanding pairwise comparisons as standard decision theory—you are comparing the expected values of two lotteries, nothing more. Is the context about psychology somehow? If so, that might be interesting, but adds a layer of complexity this sort of methodology cannot be expected to handle.

Players may have different utility functions, but that might be reasonable to ignore when modelling all of this. In any case, every intervention $A_{i}$ will have its own, unique, expected utility from each player $p$ , hence $x_{i j}^{p} = E [U_{p} (A_{i})] / E [U_{p} (A_{j})] = 1 / x_{j i}^{p}$ . (This is ignoring noise in the estimates, but that is pretty easy to handle.)

Jonas Moss May 29, 2023, 4:34 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Relative Value Functions: A Flexible New Format for Value Estimation
Estimation is actually pretty easy (using linear regression), and is essentially a solved problem since 1952. Scheffé, H. (1952). An Analysis of Variance for Paired Comparisons. Journal of the American Statistical Association, 47(259), 381–400. https://doi.org/10.1080/01621459.1952.10501179

I wrote about the methodology (before finding Scheffé′s paper) here.

Jonas Moss Feb 21, 2023, 2:28 PM
6 points
2 ∶ 0
in reply to: Jan_Kulveit’s comment on: There are no coherence theorems
Do I understand you correctly here?

Each agent has a computable partial preference ordering $x \leq y$ that decides if it prefers $x$ to $y$ .

We’d like this partial relation to be complete (i.e., defined for all $x, y$ ) and transitive (i.e., $x \leq y$ and $y \leq z$ implies $x \leq z$ ).

Now, if the relation is sufficiently non-trivial, it will be expensive to compute for some $x, y$ . So it’s better left undefined...?

If so, I can surely relate to that, as I often struggle computing my preferences. Even if they are theoretically complete. But it seems to me the relationship is still defined, but might not be practical to compute.

It’s also possible to think of it in this way: You start out with partial preference ordering, and need to calculate one of its transitive closures. But that is computationally difficult, and not unique either.

I’m unsure what these observations add to the discussion, though.

Jonas Moss Feb 16, 2023, 2:43 PM
−1 points
3 ∶ 1
on: Fixing the vegetarian plate: A new guide aims to correct misconceptions and educate the health-care community about the vegetarian diet
Some comments:
1. Have you considered hiring a designer for this document? It doesn’t look good at all and is filled up with bold faces all over the place.
2. Why is it so long? I don’t see why it’s important for vegans to know that cows are supplemented with vitamin B12.
3. It could have benefited a lot from lists of key takeaways. For instance, do you need to take vitamin D3 supplementation, and how much? Much of the document feels like an info dump to me.

Jonas Moss Jan 16, 2023, 11:55 AM
4 points
5 ∶ 0
in reply to: Guy Raveh’s comment on: The writing style here is bad
Sure, if your goal is to be a good writer! But, I’m not worried about that. I just want people to understand me.
What links here?
- Guy Raveh's comment on The writing style here is bad by Michał Zabłocki (Jan 16, 2023, 5:23 PM; 4 points)

Jonas Moss Jan 16, 2023, 11:52 AM
1 point
0 ∶ 0
in reply to: JoshuaBlake’s comment on: The writing style here is bad
As far as I can recall, my paragraphs are usually about half as long when I ask ChatGPT to simplify.

That said, I tend to write in an academic style.

Jonas Moss Jan 15, 2023, 9:57 AM
16 points
12 ∶ 13
on: The writing style here is bad
I agree that academic language should be avoided in both forums and research papers.

It might be a good idea for forum writers to use a tool like ChatGPT to make their posts more readable before posting them. For example, they can ask ChatGPT to “improve the readability” of their text. This way, writers don’t have to change their writing style too much and can avoid feeling uncomfortable while writing. Plus, it saves time by not having to go back and edit clunky sentences. Additionally, by asking ChatGPT to include more slang or colloquial language, the tool can better match the writer’s preferred style. (Written with the aid of ChatGPT in exactly the way I proposed. :p)

Jonas Moss Nov 26, 2022, 12:46 PM
2 points
0 ∶ 0
on: Success Maximization: An Alternative to Expected Utility Theory and a Generalization of Maxipok to Moral Uncertainty
If I understand you correctly, what you’re proposing is essentially a subset of classical decision theory with bounded utility functions. Recall that, under classical decision theory, we choose our action according to $max a \in A E [u (a, X)],$ where $X$ is a random state of nature and $A$ an action space.

Suppose there are $N$ (infinitely many works too) moral theories $s_{1}, s_{2}, \dots, s_{N}$ , each with probability $p (s_{i})$ and associated utility $u_{i}$ . Then we can define $u (a, X) = N \sum i = 1 p (s_{i}) u_{i} (a, X) .$ This step gives us (moral) uncertainty in our utility function.

Then, as far as I understand you, you want to define some component utility functions as $\begin{matrix} u_{i} (a, X) & = & {\begin{matrix} 1, & if (a, X) is acceptable under theory s_{i}, 0, & if (a, X) is unacceptable under theory s_{i} . \end{matrix} \end{matrix}$ As then $0 \leq E u_{i} (a, X) \leq 1$ is the probability of an acceptable outcome under $s_{i}$ . And since we’re taking the expected value of these bounded component utilities to construct $u$ , we’re in classical bounded utility function land.

That said, I believe that
1. This post would benefit from a rewrite of the paragraph starting with “Success maximization is a mechanism by which to generalize maxipok”. It states ” Let $a_{i}$ be an action $i$ from the set of $m$ actions $A = a_{1}, a_{2}, \dots, a_{m}$ . ” Is $i$ and action, $a_{i}$ and action, or both? I also don’t understand what $π$ is. Are there states of nature in this framework? You say that $s$ is a moral theory, so it cannot be $s$ ?
2. You should add concrete examples. If you add one or two it might become easier to understand what you’re doing despite the formal definition not being 100% clear.

Jonas Moss Nov 16, 2022, 6:59 AM
1 point
0 ∶ 0
on: Some research ideas in forecasting
Thanks for writing this.
1. I wrote about “decay of predictions” here. I would classify the problem as hard.
2. Do you have a feeling for how suitable the projects are for academic projects? Such as bachelor theses or master theses, perhaps? It would be great to show a list of projects to students!

Jonas Moss Oct 20, 2022, 5:32 PM
1 point
0 ∶ 0
in reply to: DC’s comment on: DonyChristie’s Shortform
Could you elaborate?

Jonas Moss Oct 11, 2022, 5:09 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Prediction market does not imply causation
Sorry, but I don’t understand what you mean.

Here’s the context I’m thinking about. Say you have two options $Y_{a}$ and $Y_{b}$ . They have different true expected values $E (Y_{a})$ and $E (Y_{b})$ . The market estimates their expectations as $^E (Y_{a})$ and $^E (Y_{b})$ . And you (or the decider) choose the option with highest estimated expectation. (I was unclear about estimation vs. true values in my previous comment.)

Does this have something to do with your remarks here?

Also, there’s always a way to implement “the market decides”. Instead of asking P(Emissions|treaty), ask P(Emissions|market advises treaty), and make the market advice = the closing prices. This obviously won’t be very helpful if no-one is likely to listen to the market, but again the point is to think about markets that people are likely to listen to.

Jonas Moss Oct 11, 2022, 4:41 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Prediction market does not imply causation
Potential outcomes are very clearly and rigorously defined as collections of separate random variables, there is no “I know it when I see it” involved. In this case you choose between two options, and there is no conditional probability involved unless you actually need it for estimation purposes.

Let’s put it a different way. You have the option of flipping two coins, either a blue coin or a red coin. You estimate the expected probability of heads as $P (blue) = 0.6$ and $P (red) = 0.5$ . You base your choice of which coin to toss on which probability is the largest. There is actually no need to use scary-sounding terms like counterfactuals or potential outcomes at all, you’re just choosing between random outcomes.

We could create a separate market on how the decision market resolves, and it will resolve unambiguously.

That sounds like an unnecessarily convoluted solution to a question we do not need to solve!

However we deal with that, I expect the story ends up sounding quite similar to my original comment—the critical step is that the choice does not depend on anything but the closing price.

Yes, I agree. And that’s why I believe we shouldn’t use conditional probabilities at all, as it makes it confusion possible.

Jonas Moss Oct 11, 2022, 3:49 AM
1 point
0 ∶ 0
in reply to: David Johnston’s comment on: Prediction market does not imply causation
In this case it would be best to use the language of counterfactuals (aka potential outcomes) instead of conditional expectations. In practice, the market would estimate $E [Y_{a}]$ and $E [Y_{b}]$ for the two random functions $Y_{a}$ and $Y_{b}$ , and you would choose the option with the highest estimated expected value. There is no need to put conditional probability into the mix at all, and it’s probably best not to, as there is no obvious probability to assign to the “events” $a$ and $b$ .

Jonas Moss Oct 10, 2022, 12:59 PM
3 points
0 ∶ 0
in reply to: Peter McLaughlin’s comment on: Getting on a different train: can Effective Altruism avoid collapsing into absurdity?

Satan cuts an apple into a countable infinity of slices and offers it to Eve, one piece at a time. Each slice has positive utility for Eve. If Eve eats only finitely many pieces, there is no difficulty; she simply enjoys her snack. If she eats infinitely many pieces, however, she is banished from Paradise. To keep things simple, we may assume that the pieces are numbered: in each time interval, the choice is Take piece n or Don’t take piece n. Furthermore, Eve can reject piece n, but take later pieces. Taking any countably infinite set leads to the bad outcome (banishment). Finally, regardless of whether or not she is banished, Eve gets to keep (and eat) her pieces of apple. Call this the original version of Satan’s apple.

We shall sometimes discuss a simplified version of Satan’s apple, different from the original version in two respects. First, Eve is banished only if she takes all the pieces. Second, once Eve refuses a piece, she cannot take any more pieces. These restrictions make Satan’s apple a close analogue to the two earlier puzzles.

Problem: When should Eve stop taking pieces?

Source: Satan, Saint Peter and Saint Petersburg

Jonas Moss Oct 10, 2022, 7:47 AM
1 point
0 ∶ 0
in reply to: ChanaMessinger’s comment on: What is the most pressing feature to add to the forum? Upvote for in general, agreevote to agree with reasoning.
I think the StackExchange sites have automatic reminders, or maybe even checks, of similar stuff. My last post on cross-validated (stack exchange for statistics) had hints about reproducible examples, I think.

Gwern has a writing checklist. Similar checklists could be forced on the author prior to submission.

Jonas Moss Oct 9, 2022, 5:38 AM
6 points
0 ∶ 0
in reply to: Geoffrey Miller’s comment on: A peek at pairwise preference estimation in economics, marketing, and statistics
Thanks for your suggestions! Big fan of yours for many years, by the way. Mating intelligence being the article collection that made we want to become an evolutionary psychologist (ended up a statistician though, mostly due to its much safer career path).

Now I noticed that I didn’t write in the post that these four points are just a summary. The meat of the post is being linked to. I think I have explained these terms in the linked post, at least graded pairwise comparisons and discrete choice models. But yeah… I will modify the summary to use less technical jargon and provide an introduction.

I think it’s important to build more connections between EA approaches to value (e.g. in AI alignment) and existing behavioral sciences methods for studying values.

Yes, and also to academia in general. I honestly didn’t think about AI alignment when writing this post, but that could be one of the applications.

A peek at pairwise preference estimation in economics, marketing, and statistics

Jonas MossOct 8, 2022, 4:56 AM

31 points

5 comments3 min readEA link

(blog.jonasmoss.com)

Jonas Moss Oct 7, 2022, 2:31 PM
7 points
1 ∶ 2
on: Getting on a different train: can Effective Altruism avoid collapsing into absurdity?

Thomas Hurka’s St Petersburg Paradox: Suppose you are offered a deal—you can press a button that has a 51% chance of creating a new world and doubling the total amount of utility, but a 49% chance of destroying the world and all utility in existence. If you want to maximise total expected utility, you ought to press the button—pressing the button has positive expected value. But the problem comes when you are asked whether you want to press the button again and again and again—at each point, the person trying to maximise expected utility ought to agree to press the button, but of course, eventually they will destroy everything.[2]

I have two gripes with this thought experiment. First, time is not modelled. Second, it’s left implicit why we should feel uneasy about the thought experiment. And that doesn’t work due to highly variable philosophical intuitions. I honestly don’t feel uneasy about the thought experiment at all (only slightly annoyed). But maybe I would have it been completely specified.

I can see two ways to add a time dimension to the problem. First, you could let all the presses be predetermined and in one go, where we get into Satan’s apple territory. Second, you could have 30 seconds pause between all presses. But in that case, we would accumulate massive amounts of utility in a very short time—just the seconds in-between presses would be invaluable! And who cares if the world ends in five minutes with probability $1 - {0.49}^{10}$ when every second it survives is so sweet? :p

Jonas Moss Oct 7, 2022, 2:21 PM
4 points
1 ∶ 0
in reply to: DirectedEvolution’s comment on: Getting on a different train: can Effective Altruism avoid collapsing into absurdity?
I don’t understand the relevance of the Kelly criterion. The wikipedia page for the Kelly criterion states that “[t]he Kelly bet size is found by maximizing the expected value of the logarithm of wealth,” but that’s not relevant here, is it?

Jonas Moss

A peek at pair­wise prefer­ence es­ti­ma­tion in eco­nomics, mar­ket­ing, and statistics

A peek at pairwise preference estimation in economics, marketing, and statistics