James Brady comments on AMA: Ought

James Brady 10 Aug 2022 8:03 UTC
5 points
0 ∶ 0
In a research assistant setting, you could imagine the top-level task being something like “Was this a double-blind study?”, which we might factor out as:
- Were the participants blinded?
  - Was there a placebo?
    Which paragraphs relate to placebos?
    Does this paragraph state there was a placebo?
    …
  - Did the participants know if they were in the placebo group?
    …
- Were the researchers blinded?
  - …
In this example, by the time we get to the “Does this paragraph state there was a placebo?” level, a submodel is given a fairly tractable question-answering task over a given paragraph. A typical response for this example might be a confidence level and text spans pointing to the most relevant phrases.
- Yonatan Cale 11 Aug 2022 14:20 UTC
  2 points
  0 ∶ 0
  Parent
  Thank you, this was super informative! My understanding of Ought just improved a lot
  1. Once you’re able to answer questions like that, what do you build next?
  2. Is “Was this a double-blind study?” an actual question that your users/customers are very interested in?
    If not, could you give me some other example that is?
  - James Brady 11 Aug 2022 15:11 UTC
    3 points
    0 ∶ 0
    Parent
    You’re welcome!
    The goal for Elicit is for it to be a research assistant, leading to more and higher quality research. Literature review is only one small part of that: we would like to add functionality like brainstorming research directions, finding critiques, identifying potential collaborators, …
    
    Beyond that, we believe that factored cognition could scale to lots of knowledge work. Anywhere the tasks are fuzzy, open-ended, or have long feedback loops, we think Elicit (or our next product) could be a fit. Journalism, think-tanks, policy work.
    It is, very much. Answering so-called strength of evidence questions accounts for big chunks of researchers’ time today.