New bipartisan bill would require labeling of AI-generated videos and audio

bnew

Veteran
Joined
Nov 1, 2015
Messages
43,344
Reputation
7,312
Daps
132,306

SocialMedia-1024x683.jpg


New bipartisan bill would require labeling of AI-generated videos and audio​

Politics Mar 21, 2024 5:32 PM EDT

WASHINGTON (AP) — Bipartisan legislation introduced in the House Thursday would require the identification and labeling of online images, videos and audio generated using artificial intelligence, the latest effort to rein in rapidly developing technologies that, if misused, could easily deceive and mislead.

So-called deepfakes created by artificial intelligence can be hard or even impossible to tell from the real thing. AI has already been used to mimic President Joe Biden’s voice, exploit the likenesses of celebrities and impersonate world leaders, prompting fears it could lead to greater misinformation, sexual exploitation, consumer scams and a widespread loss of trust.

Key provisions in the legislation would require AI developers to identify content created using their products with digital watermarks or metadata, similar to how photo metadata records the location, time and settings of a picture. Online platforms like TikTok, YouTube or Facebook would then be required to label the content in a way that would notify users. Final details of the proposed rules would be crafted by the National Institute of Standards and Technology, a small agency within the U.S. Department of Commerce.

Violators of the proposed rule would be subject to civil lawsuits.

WATCH: Why more people are turning to artificial intelligence for companionship

“We’ve seen so many examples already, whether it’s voice manipulation or a video deepfake. I think the American people deserve to know whether something is a deepfake or not,” said Rep. Anna Eshoo, a Democrat who represents part of California’s Silicon Valley. Eshoo co-sponsored the bill with Republican Rep. Neal Dunn of Florida. “To me, the whole issue of deepfakes stands out like a sore thumb. It needs to be addressed, and in my view the sooner we do it the better.”

If passed, the bill would complement voluntary commitments by tech companies as well as an executive order on AI signed by Biden last fall that directed NIST and other federal agencies to set guidelines for AI products. That order also required AI developers to submit information about their product’s risks.

Eshoo’s bill is one of a few proposals put forward to address concerns about the risks posed by AI, worries shared by members of both parties. Many say they support regulation that would protect citizens while also ensuring that a rapidly growing field can continue to develop in ways that benefit a long list of industries like health care and education.

The bill will now be considered by lawmakers, who likely won’t be able to pass any meaningful rules for AI in time for them to take effect before the 2024 election.

“The rise of innovation in the world of artificial intelligence is exciting; however, it has potential to do some major harm if left in the wrong hands,” Dunn said in a statement announcing the legislation. Requiring the identification of deepfakes, he said, is a “simple safeguard” that would benefit consumers, children and national security.

Several organizations that have advocated for greater safeguards on AI said the bill introduced Thursday represented progress. So did some AI developers, like Margaret Mitchell, chief AI ethics scientist at Hugging Face, which has created a ChatGPT rival called Bloom. Mitchell said the bill’s focus on embedding identifiers in AI content — known as watermarking — will “help the public gain control over the role of generated content in our society.”

“We are entering a world where it is becoming unclear which content is created by AI systems, and impossible to know where different AI-generated content came from,” she said.
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
43,344
Reputation
7,312
Daps
132,306
Key provisions in the legislation would require AI developers to identify content created using their products with digital watermarks or metadata, similar to how photo metadata records the location, time and settings of a picture. Online platforms like TikTok, YouTube or Facebook would then be required to label the content in a way that would notify users. Final details of the proposed rules would be crafted by the National Institute of Standards and Technology, a small agency within the U.S. Department of Commerce.


how would this work with open source software especially when the developers aren't located in the united states?
2JqMm1y.png
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
43,344
Reputation
7,312
Daps
132,306

A new bill wants to reveal what’s really inside AI training data​


Rep. Adam Schiff’s bill garnered support from several entertainment industry groups.​

By Emilia David, a reporter who covers AI. Prior to joining The Verge, she covered the intersection between technology, finance, and the economy.

Apr 10, 2024, 1:35 PM EDT

Photo illustration of the shape of a brain on a circuitboard.

Illustration: Cath Virginia / The Verge | Photos: Getty Images

A new bill would compel tech companies to disclose any copyrighted materials that are used to train their AI models.

The Generative AI Copyright Disclosure bill from Rep. Adam Schiff (D-CA) would require anyone making a training dataset for AI to submit reports on its contents to the Copyrights Register. The reports should include a detailed summary of the copyrighted material in the dataset and the URL for the dataset if it’s publicly available. This requirement will be extended to any changes made to the dataset.

Companies must submit a report “not later than 30 days” before the AI model that used the training dataset is released to the public. The bill will not be retroactive to existing AI platforms unless changes are made to their training datasets after it becomes law.

Schiff’s bill hits on an issue artists, authors, and other creators have been complaining about since the rise of generative AI: that AI models are often trained on copyrighted material without permission. Copyright and AI have always been tricky to navigate, especially as the question of how much AI models change or mimic protected content has not been settled. Artists and authors have turned to lawsuits to assert their rights.

Developers of AI models claim their models are trained on publicly available data, but the sheer amount of information means they don’t know specifically which data is copyrighted. Companies have said any copyrighted materials fall under fair use. Meanwhile, many of these companies have begun offering legal cover to some customers if they find themselves sued for copyright infringement.

Schiff’s bill garnered support from industry groups like the Writers Guild of America (WGA), the Recording Industry Association of America (RIAA), the Directors Guild of America (DGA), the Screen Actors Guild - American Federation of Television and Radio Artists (SAG-AFTRA), and the Authors Guild. Notably absent from the list of supporters is the Motion Picture Association (MPA), which normally backs moves to protect copyrighted work from piracy. (Disclosure: The Verge’s editorial staff is unionized with the Writers Guild of America, East.)

Other groups have sought to bring more transparency to training datasets. The group Fairly Trained wants to add labels to AI models if they prove they asked for permission to use copyrighted data.
 
Top