Axel Svensson

Karma: 67

Time-stamping: An urgent, neglected AI safety measure

Axel Svensson30 Jan 2023 11:21 UTC

57 points

27 comments3 min readEA link

Axel Svensson 30 Jan 2023 19:32 UTC
4 points
1 ∶ 0
in reply to: Matt Goodman’s comment on: Time-stamping: An urgent, neglected AI safety measure
Let me clarify the cryptography involved:

There is cryptographic signing, that lets Alice sign a statement X so that Bob is able to cryptographically verify that Alice claims X. X could for example be “Content Y was created in 2023”. This signature is evidence for X only to the extent that Bob trusts Alice. This is NOT what I suggest we use, at least not primarily.

There is cryptographic time-stamping, that lets Alice timestamp content X at time T so that Bob is able to cryptographically verify that content X existed before time T. Bob does not need to trust Alice, or anyone else at all, for this to work. This is what I suggest we use.

Back-dating content is therefore cryptographically impossible when using cryptographic time-stamping. That is kind of the point; otherwise I wouldn’t be convinced that the value of the timestamps would grow over time. To the extent we use cryptographic time-stamping, the argument here is ‘it will be entirely impossible in the future’.

However, cryptographic time-stamping and cryptographic signing can be combined in interesting ways:
1. We could sign first and then timestamp, achieving a cryptographic proof that in or before 2023, archive.org claimed that content X was created in 1987. This might be valuable if the organization or its cryptographic key at a later date were to be compromised, e.g. by corruption, hacking, or government overreach. Timestamps created after an organization is compromised can still be trusted: You can always know the content was created in or before 2023, even if you have reason to doubt a claim made at that time.
2. We could timestamp, then sign, then timestamp. This allows anyone to cryptographically verify that e.g. sometime between 2023-01-20 and 2023-01-30, Alice claimed that content X was created in 1987. This could be valuable if we later learn we have reason to distrust the organization before a certain date. Again, we will always know X was created before 2023-01-30, no matter anyone’s trustworthiness.
As for the issue with 2023 timestamps being misleading for 1995 content: This issue is probably very real, but it’s less urgent. Making the timestamps is urgent. On top of that underlying data and cryptographic proofs, different UIs can be built and improved over time.

Axel Svensson 31 Jan 2023 9:11 UTC
3 points
0 ∶ 0
in reply to: Tier 1 Longtermist’s comment on: Time-stamping: An urgent, neglected AI safety measure
I think the compute/network for the hash (going through literally all content) seems large, possibly multiple orders of magnitude more than the cost implied here.
Yeah, they say[1] they have over 100PB content. That is quite a bit, and if it’s not in an inhouse datacenter, going through it will be expensive.
[1] https://archive.devcon.org/archive/watch/6/universal-access-to-all-knowledge-decentralization-experiments-at-the-internet-archive
If 20% of content remain not timestamped, do issues about credibility of content remain?
If 20% of content remain not timestamped, then one wouldn’t consider all non-timestamped content suspicious on that account alone. The benefits come around in other ways:
- If 80% of content is timestamped, then all that content is protected from suspicion that newer AI might have created it.
- If the internet archive is known to have timestamped all of their content, then non-timestamped content presumably from an old enough version of a web site that is in the archive, becomes suspicious.
- One might still consider non-timestamped content suspicious in a future where AI and/or institutional decline has begun nagging on the prior (default, average, general) trust for all content.
There’s probably content with many tiny variations and it’s better to group that content together? … Finding the algorithm/implementation to do this seems important but also orders of magnitude more costly?
It might be important, but it’s probably not as urgent. Timestamping has to happen at the time you want to have the timestamp for. Investigating and convincing people about what different pieces of content are equivalent from some inexact (or exact but higher-level) point of view, can be done later. I imagine that this is one possible future application for which these timestamps will be valuable. Applications such as these, I would probably put out of scope though.

Axel Svensson 30 Jan 2023 22:13 UTC
3 points
0 ∶ 0
in reply to: Eigengender’s comment on: Time-stamping: An urgent, neglected AI safety measure
Good criticism.

My rough budget guess is probably off, as you say. For some reason I just looked at hardware and took a wide margin. For a grant application this has to be ironed out a lot more seriously.

I admit that popularizing the practice for private archives would take a significant effort far beyond a 5-digit budget. I envisioned doing this in collaboration with the internet archive as a first project to reap the most low-hanging fruits, and then hopefully it’d be less difficult to convince other archives to follow suit.

It’s worth noting that implementations, commercial services and public ledgers for time-stamping already exist. I imagine scaling, operationalizing and creating interfaces for consumption would be major parts of the project for the internet archive.

Axel Svensson 30 Jan 2023 19:56 UTC
3 points
1 ∶ 0
in reply to: Jeff Kaufman’s comment on: Time-stamping: An urgent, neglected AI safety measure
This is good criticism, and I’m inclined to agree in part. I do not intend to argue that the marginal value is necessarily great, only that the expected marginal value is much greater than the cost. Here are a couple plausible but maybe less than 50% probability scenarios in which the timestamps can have significant impact on society:
- Both western and eastern governments have implemented a good portion of the parts of Orwell’s vision that have so far been feasible, in particular mass espionage and forcing content deletion. Editing history in a convincing way has so far been less feasible, but AI might change that, and it isn’t clear why we should believe that no government that has a valuable public archive in their jurisdiction would contemplate doing so.
- A journalist is investigating a corruption scandal with far-reaching consequences. Archives are important tools, but it just so happens that this one archive until recently had an employee that is connected to a suspect...
- In order to prove your innocence, or someone else’s guilt, you need to verify and be able to prove what was privately emailed from your organization. Emails are not in the internet archive, but luckily your organization uses cryptographic timestamps for all emails and documents.

Axel Svensson 1 Feb 2023 19:57 UTC
2 points
1 ∶ 0
in reply to: Douglas Knight’s comment on: Time-stamping: An urgent, neglected AI safety measure

… mail sent through Google is signed. Google can’t repudiate these signatures, but you have to trust them not to write new history. Matthew Green calls for the opposite: for Google to publish its old private keys to destroy this information.

Interesting take on the dangers of strong validation. I note that time-stamping the signatures would prevent Google both from writing new history, and from doing what Mr Green wants.

I haven’t taken the time to consider whether Mr Green’s point is valid, but i instinctively hope it isn’t because of what it would mean for the value of aiding truth-seeking.

Axel Svensson 30 Jan 2023 20:00 UTC
2 points
0 ∶ 0
in reply to: MathiasKB’s comment on: Time-stamping: An urgent, neglected AI safety measure

To what extent could this be implemented as an addition to the internet archive?

It might be advantageous to do so for content that is in the internet archive. For content that is not, especially non-public content, it might be more feasible to offer the solution as a public service + open source software.

Axel Svensson 30 Jan 2023 19:35 UTC
2 points
0 ∶ 0
in reply to: John Salter’s comment on: Time-stamping: An urgent, neglected AI safety measure
Thank you! Feels great to get such response at my first post.

Axel Svensson 31 Jan 2023 14:43 UTC
1 point
0 ∶ 0
in reply to: philh’s comment on: Time-stamping: An urgent, neglected AI safety measure
If I find something in 2033 and want to prove it existed in 2023, I think that’s going to be much harder if I have to rely on the thing itself being archived in 2023, in an archive that still exists in 2033; compared to just relying on the thing being timestamped in 2023.
Yeah, I think this is an unfortunate technical necessity, but only in the case where the thing you find in 2033 has been changed (in an irrelevant way). If you find something in 2033 that was actually timestamped in 2023, you don’t need access to an archived version, since it’s identical to what you already have.
I also think if you’re relying on the Internet Archive, the argument that this is urgent becomes weaker … As long as you set it up before IA goes rogue, the cost of delay is lower.
This is fair criticism. IA does in fact timestamp content and has for a good while, just not trustlessly (at least not intentionally AFAIK). So, to the extent (in jurisdiction and time) that people in general can trust IA, including their intentions, competence, technology and government, perhaps the value really is marginal at the present time.
Perhaps I will slightly decrease my belief about the urgency, although I remain convinced this is worth doing. I see the value as mostly long-term, and IA’s claims for what content was created when, is itself a treasure of arguably reliable recordkeeping worth protecting by timestamping.

Axel Svensson 31 Jan 2023 14:12 UTC
1 point
0 ∶ 0
in reply to: Matt Goodman’s comment on: Time-stamping: An urgent, neglected AI safety measure
you could be selective in how you roll it out. Images of … high profile public figures seem most likely to be manipulated
Thank you, perhaps the first priority should be to quickly operationalize timestamping of newly created content at news organizations. Perhaps even before publication, if they can be convinced to do so.

Axel Svensson 31 Jan 2023 9:45 UTC
1 point
0 ∶ 0
in reply to: philh’s comment on: Time-stamping: An urgent, neglected AI safety measure
There’s “it’s easy for someone to publish a thing and prove it was published before $time”. Honestly that’s pretty easy already … marginally more trustless (blockchain) would be marginally good, and I think cheap and easy.
More trustless is the main point. The marginal value could grow over time, or depending on your situation/jurisdiction/trust, be larger already for some people than others. Perhaps there are already certain countries where journalists aren’t supposed to trust institutions in other certain countries?
But I think what you’re talking about here is doing this for all public content, whether the author knows or not. … the big problem I see here is that a lot of things get edited after publishing … everything on imgur can no longer be verified, at least not to before the recompression … There’s a related question of how you distinguish content from metadata
If you change it, then the new thing is different from the old thing, and the new thing did not exist before it was created. If you change compression method, blog theme, spelling or some other irrelevant aspect, you can start from the archived old version, prove that it was created before a certain point in time, then try to convince me that the difference between old and new is irrelevant. If I agree, you will have proven to me that this new thing was effectively created before a certain time. If not, at least you have proven that the old version was.
I do not propose trying to centrally solve the question of what changes are relevant, or distinguishing content from metadata, because that is up to interpretation.
This problem goes away if you’re also hosting copies of everything, but that’s no longer cheap. At that point I think you’re back to “addition to the internet archive” discussed in other comments
Yes, I do admit that an archive is necessary for this to be valuable. I would prefer to execute this by cooperating very, very closely with the internet archive or another big archive for a first project.
you’re only really defending against the internet archive going rogue (though this still seems valuale), and there’s a lot that they don’t capture.
Yeah, more or less. I think more. We are also defending against other kinds of current and future compromise including hacking of a well-meaning organization, government overreach, and people’s/organizations’/journalists’ unfounded distrust in the internet archive. Organizations change, and the majority of the value proposal is a long-term one.

Axel Svensson 31 Jan 2023 8:16 UTC
1 point
0 ∶ 0
in reply to: emiyazono ’s comment on: Time-stamping: An urgent, neglected AI safety measure
That is fantastic, hopefully this indicates an organization open to ideas, and if they’ve been doing this for a while it might be worth “rescuing” those timestamps.

Axel Svensson

Time-stamp­ing: An ur­gent, ne­glected AI safety measure

Time-stamping: An urgent, neglected AI safety measure