The case for content authenticity in an age of disinformation, deepfakes and NFTs
Did you hear about the award-winning documentary photographer from an esteemed photo agency whose faked images of North Macedonia’s industrial cityscapes tricked even the skilled eyes of the experts at a recent French photojournalism festival?
Despite the random computer-generated bears that Jonas Bendiksen added to his pictures, nobody noticed his photos had been intentionally altered with computer software. That is, until Bendiksen himself pointed out the fraud using a Twitter account he created using a pseudonym.
His point was clear: If fake images can dupe the pros, imagine how hard it is for the rest of us to know what’s authentic. He wanted to see how far he could get “before the guards woke up. They never woke up.”
A wake-up call
Our inability to distinguish fantasy from reality in digital images is a wakeup call. In an increasingly fragmented media landscape, we are witnessing extraordinary challenges to trust in media. Now, the same powerful and easy-to-use tools used to make and share legitimate content are also deployed to create and spread disinformation or misinformation.
This isn’t a problem that’s going away. There could be 100 times more visual content by 2027, according to one study. One expert quoted in Nina Schick’s book Deepfakes, The Coming Infocalypse, estimates that synthetic video may account for as much as 90% of online video in just three to five years – meaning it will be generated partially or entirely by artificial intelligence (AI), not humans. Already, deepfake videos like deeptomcruise, created with the help of AI and machine learning algorithms, are incredibly convincing (hint: it’s not actually Tom Cruise).
A study this year in the journal Nature about the rise of misinformation online, found that people are more focused on sharing what they think will boost their social status than in sharing what is true. Governments and bad actors already know this and use it to spread misinformation (unintentional deception) or sow disinformation (deceit with intent).
Giving credit where it’s due
Getting at the truth isn’t the only thing at stake. Understanding the authenticity of content is also a big deal for the authors of the content – the creators, the creative professionals and the communicators.
Content creators often go uncredited and unpaid when their images are repurposed into an endless variety of re-edited viral memes that stray from their original intent or purpose. In the red-hot market for non-fungible tokens (NFTs), protecting the creators of these original pieces of “cryptoart” should be just as important as protecting the collectors who buy them.
From photos to deepfake videos to NFTs and other digital file types, true transparency into a how a piece of digital content is created and/or changed, is critical to ensuring whether we can trust the source. What’s real, what’s fake, what’s valuable, who created it and with which editing tools – as well as what changes were made to an original piece – are the questions we should all be asking.
How to know what is authentic
These questions arise as many of us wrestle with new terms to help us understand what is real and what is fake. Deepfakes, cheap/shallowfakes, NFTs, blockchains, synthetic media, content authenticity and provenance – knowing these buzzwords in the context of our increasingly digital lives is imperative. (Need help? Scroll for our content authenticity primer.)
What these words mean for content creators and society is the foundation of our efforts leading the Content Authenticity Initiative (CAI). Building on Adobe’s heritage of providing trustworthy and innovative digital solutions for our customers, the CAI is a collaborative community of leading media and technology companies as well as individual creators, technologists, journalists, activists, and leaders addressing misinformation and content authenticity at scale. What started in late 2019 as a venture between Adobe, Twitter and The New York Times has now grown to more than 375 members over the past two years – including AFP, Arm, the BBC, Getty Images, Microsoft, Nikon, Qualcomm, Truepic, The Washington Post, and many others.
As a former war photographer and media executive who has dedicated my career to getting at the truth, often at great personal risk, I strongly believe that having a shared understanding of objective facts can help us all make more thoughtful decisions when consuming media.
The CAI and C2PA
With the CAI, we’re busy raising awareness, advocacy and creating educational programs. In the Coalition for Content Provenance and Authenticity (C2PA), we are among many contributing to the creation of an open standard, a blueprint for a new technical specification (currently open for public feedback) to digital content “provenance.”
This is a digital chain of custody, like a manifest, that shows where content came from, who made it, and the journey it has been on since it was published, including edits and other manipulations. The new CP2A opt-in, open-source attribution standard is a powerful example of how a diverse group of companies can come together, to collaborate on what it means to take responsibility for the entire media ecosystem.
It’s going to take time and everyone’s participation to make true content authenticity a reality. I’m confident that together, we can begin to restore the collective trust and understanding that right now seems so challenging.
The C2PA is only one of the collaborative efforts the CAI has worked on since its inception just two years ago. We will have more updates to share during Adobe MAX 2021 (free and all virtual from Oct. 26-28) as we continue to help solve fundamental challenges of content authenticity.
It’s going to take true industry collaboration to fight misinformation with attribution and verifiable truth of content, but with the CAI, we’re committed to ensure creators are getting credit for their digital work, and consumers have a way to verify whether digital content has been altered and to what degree.
To stay up to date with the CAI and our open, extensible approach for providing media transparency to allow for better evaluation of content, sign up to become a CAI member organization.
The vocabulary of Content Authenticity
With the volume of digital content increasing, people want to know that what they see online is authentic. Currently there’s a lack of transparency. People can be misled when they don’t know who is altering content and what content has been changed. Content authenticity is when there is proper content attribution for creators and publishers, which helps ensure trust and transparency online.
Other important concepts and definitions around content authenticity:
- Attribution
- The action of crediting work, such as art or designs, to a particular person.
- Blockchain
- A specific type of database that stores data in blocks and chains them together. Blockchain is considered highly secure because each block is given an exact timestamp when added to the chain, and a new block can’t be added until legitimacy of the previous blocks is verified. Blocks can be housed on hundreds or thousands of computers, and not under the same roof, thus why blockchain is often called a distributed digital ledger.
- Content credentials
- Digital keys or certificates that can be used to verify that content is authentic and has not been corrupted or altered.
- Cheapfake
- Photos or videos that may or may not have been altered but that have been misrepresented with intent to deceive. Unlike deepfakes, “cheapfakes” can be made with readily available technology, such as face swapping. Another term for this is Shallowfake.
- Deepfake
- A deepfake is a video or photo that has been digitally altered so that a person’s likeness has been altered or replaced by someone else, often using sophisticated AI technology or machine learning.
- Digital chain of custody
- The chronological record of a digital piece of media that provides tamper-evident documentation of who, when, and how the content was produced and shared from creation to viewing.
- Metadata
- Information about specific data, summarizing underlying details about the data itself. Like a reference table about data.
- Disinformation
- False information that is known to be false. People spread disinformation to deceive people.
- Misinformation
- False, inaccurate, or misleading information that is shared regardless of intention to deceive, which someone believes to be true. When people share misinformation, they aren’t necessarily trying to deceive, they are simply sharing things that aren’t true or accurate. An example of this could be sharing a photo of hurricane damage in Texas taken 10 years ago in the context of a hurricane last year in Florida.
- NFT (non-fungible token)
- A digital asset that represents a real-world object such as art, music, or videos. A NFT can be bought and sold online, usually with cryptocurrency, and encoded with the same underlying software (blockchain) technology as the cryptocurrency. NFTs have unique identifying codes, which indelibly ties it the owner to it. However, there is currently no way to tie a creator’s authenticity to the NFT.
- Digital Provenance
- Tamper-evident record of digital media, showing both custody of the file (creator, editor, producer to export) and the series of edits or actions taken.
- Open industry standard
- An open industry standard is a phrase often used in technology to describe something that’s developed (or approved) and maintained via a collaborative and consensus driven process and intended for the general public. PDF is an example of an open industry standard. Although originally developed by Adobe, in 2008 the PDF file specification was released as an open standard to the International Organization for Standardization (ISO) who manages future development of the file format via a committee of volunteer industry experts.
- Synthetic media
- A catch-all term for the artificial production, manipulation, and modification of data and media by automated means, especially using artificial intelligence algorithms, such as for the purpose of misleading people or changing an original meaning. Synthetic media as a field has grown rapidly since the creation of generative adversarial networks (GANs), primarily through the rise of deepfakes as well as music synthesis, text generation, human image synthesis, speech synthesis, and more. algorithms, such as for the purpose of misleading people or changing an original meaning. Synthetic media as a field has grown rapidly since the creation of generative adversarial networks (GANs), primarily through the rise of deepfakes as well as music synthesis, text generation, human image synthesis, speech synthesis, and more.