Games

Twitch is Limiting Streamers To 100 hours of Highlights and Uploads (theverge.com) 23

Twitch is planning to cull some of the content archived by streamers to save on storage costs. From a report: On Wednesday the streaming platform announced that it will introduce a 100-hour storage cap for Highlights and Uploads starting April 19th, warning that users will have their content automatically deleted until it falls below the limit.

Twitch says it's doing this because "Highlights haven't been very effective in driving discovery or engagement," and it isn't worth the cost of storing thousands of hours of such content. Twitch is owned by Amazon, a market-leading cloud storage provider -- a detail that hasn't gone unnoticed by streamers criticizing the decision.

Iphone

Apple Launches the iPhone 16E, With In-House Modem and Support For AI (theverge.com) 82

Apple has launched the iPhone 16E, featuring a 6.1-inch OLED display, Face ID, an A18 chipset, USB-C, 48MP camera, and support for Apple Intelligence. Gone but not forgotten: the home button, Touch ID and 64GB of base storage. The Verge reports: The 16E includes the customizable Action Button, but not the new Camera Control you'll find on the 16 series. It does swap its Lightning port for USB-C, now a requirement for the phone to be sold in the EU. On the inside, there's an A18 chipset, the same chip as the iPhone 16. That makes the 16E powerful enough to run Apple Intelligence, the suite of AI tools that includes notification summaries. Even the non-Pro iPhone 15 can't do that, so the 16E is one of the most capable iPhones out there. Apple has previously confirmed that 8GB RAM was the minimum to get Apple Intelligence support in the iPhone 16 series, so it's likely that the 16E also boasts at least that much memory. It's also been bumped to a baseline of 128GB of storage, meaning there's no longer a 64GB iPhone.

There's only a single 48-megapixel rear camera; the lack of additional cameras is the biggest downgrade compared to the company's other handsets. With support for wireless charging and a water-resistant IP rating, there's little you have to give up elsewhere. The iPhone 16E is also the first iPhone to include a modem developed by Apple itself. The company has spent years trying to move away from modems developed by Qualcomm, and we're finally seeing the fruits of that labor. The big questions now are how well the new modem performs and whether Apple is ready to roll out its own connectivity components in the iPhone 17 line later this year.
It's available for Friday starting at $599 with 128GB of storage.
Data Storage

Sandisk Puts Petabyte SSDs On the Roadmap (tomshardware.com) 28

SanDisk aims to produce petabyte-scale SSDs through its new UltraQLC platform, though the company has not specified a release timeline. The technology, it said, combines SanDisk's BICS 8 QLC 3D NAND with a proprietary 64-channel controller featuring hardware accelerators that offload storage functions from firmware to reduce latency and improve reliability.

The initial UltraQLC drives will use 2Tb NAND chips to reach 128TB capacities, with future iterations targeting 256TB, 512TB, and eventually 1PB as higher-density NAND becomes available. The controller dynamically adjusts power based on workload and employs an advanced bus multiplexer to handle increased data loads from high-density QLC stacks, the company said.
Data Storage

NAND Flash Prices Plunge Amid Supply Glut, Factory Output Cut (theregister.com) 34

NAND flash prices are expected to slide due to oversupply, forcing memory chipmakers to cut production to match lower-than-expected orders from PC and smartphone manufacturers. From a report: The superabundance of stock is putting a financial strain on suppliers of NAND flash, according to TrendForce, which says growth rate forecasts are being revised down from 30 percent to 10-15 percent for 2025.

"NAND flash manufacturers have adopted more decisive production cuts, scaling back full-year output to curb bit supply growth. These measures are designed to swiftly alleviate market imbalances and lay the groundwork for a price recovery," TrendForce stated.

Shrish Pant, Gartner director analyst and technology product leader, expects NAND flash pricing to remain weak for the first half of 2025, though he projects higher bit shipments for SSDs in the second half due to continuing AI server demand.

"Vendors are currently working tirelessly to discipline supply, which will lead to prices recovering in the second half of 2025. Long term, AI demand will continue to drive the demand for higher-capacity/better-performance SSDs," Pant said. Commenting on the seasonal nature of the memory market, Pant told The Register: "Buying patterns will mean that NAND flash prices will remain cyclical depending on hyperscalers' buying behavior."

Businesses

When a Lifetime Subscription Can Save You Money - and When It's Risky (msn.com) 25

Apps offering lifetime subscriptions may pose risks despite potential cost savings, according to cybersecurity experts and analysts. While some lifetime plans can pay off quickly - like dating app Bumble's $300 premium subscription that breaks even in five months - others require years of use to justify hefty upfront costs. Meditation app Waking Up charges $1,500 for lifetime access, requiring over 11 years of use to recoup the investment.

Security researchers warn against lifetime subscriptions for services with high recurring costs like VPNs and cloud storage. Such providers may compromise user privacy or cut corners on infrastructure to offset losses, said Trevor Hilligoss, senior vice president at cybercrime research group SpyCloud Labs.
AI

DeepSeek Removed from South Korea App Stores Pending Privacy Review (france24.com) 3

Today Seoul's Personal Information Protection Commission "said DeepSeek would no longer be available for download until a review of its personal data collection practices was carried out," reports AFP. A number of countries have questioned DeepSeek's storage of user data, which the firm says is collected in "secure servers located in the People's Republic of China"... This month, a slew of South Korean government ministries and police said they blocked access to DeepSeek on their computers. Italy has also launched an investigation into DeepSeek's R1 model and blocked it from processing Italian users' data. Australia has banned DeepSeek from all government devices on the advice of security agencies. US lawmakers have also proposed a bill to ban DeepSeek from being used on government devices over concerns about user data security.
More details from the Associated Press: The South Korean privacy commission, which began reviewing DeepSeek's services last month, found that the company lacked transparency about third-party data transfers and potentially collected excessive personal information, said Nam Seok [director of the South Korean commission's investigation division]... A recent analysis by Wiseapp Retail found that DeepSeek was used by about 1.2 million smartphone users in South Korea during the fourth week of January, emerging as the second-most-popular AI model behind ChatGPT.
Businesses

Amazon Tests Robots For Automating Fulfillment Centers (yahoo.com) 41

Yahoo Finance shares an interesting prediction. Amazon has an "under-the-radar robot push" that "could boost its profit margins big-time, Morgan Stanley managing director Brian Nowak said." Nowak said Amazon has quietly developed six significant next-generation fulfillment centers in the past three years that bring automation front and center... Amazon now has industrial robots that can increase efficiencies across the storage, inventory management, pick and packing, and sorting order fulfillment processes.

Fulfillment costs make up about 20% of Amazon's retail revenue, so he reasoned that automation could have a significant impact on long-term operating profit potential. Nowak says if 30% to 40% of Amazon's US units were fulfilled through next-generation robotics-enabled warehouses by 2030, it could lead to $10 billion-plus of savings... The investments in robots may already be paying off. Amazon's North America retail operating margins on a trailing 12-month basis have risen for five straight quarters. North America operating margins improved to 6.2% from 4.6% a year ago.

Nowak made the remarks on a Yahoo Finance podcast (at the top of their article) after touring one of Amazon's robot-enhanced sites in Louisiana. He believes robotics can drive down Amazon's costs compared to other retailers like Target (which he sees as lagging behind Amazon on robotics).

Meanwhile workers at an Amazon facility in North Carolina held a vote Saturday on whether to unionize. But roughly 75% of the workers voted against unionization.
Supercomputing

The IRS Is Buying an AI Supercomputer From Nvidia (theintercept.com) 150

According to The Intercept, the IRS is set to purchase an Nvidia SuperPod AI supercomputer to enhance its machine learning capabilities for tasks like fraud detection and taxpayer behavior analysis. From the report: With Elon Musk's so-called Department of Government Efficiency installing itself at the IRS amid a broader push to replace federal bureaucracy with machine-learning software, the tax agency's computing center in Martinsburg, West Virginia, will soon be home to a state-of-the-art Nvidia SuperPod AI computing cluster. According to the previously unreported February 5 acquisition document, the setup will combine 31 separate Nvidia servers, each containing eight of the company's flagship Blackwell processors designed to train and operate artificial intelligence models that power tools like ChatGPT. The hardware has not yet been purchased and installed, nor is a price listed, but SuperPod systems reportedly start at $7 million. The setup described in the contract materials notes that it will include a substantial memory upgrade from Nvidia.

Though small compared to the massive AI-training data centers deployed by companies like OpenAI and Meta, the SuperPod is still a powerful and expensive setup using the most advanced technology offered by Nvidia, whose chips have facilitated the global machine-learning spree. While the hardware can be used in many ways, it's marketed as a turnkey means of creating and querying an AI model. Last year, the MITRE Corporation, a federally funded military R&D lab, acquired a $20 million SuperPod setup to train bespoke AI models for use by government agencies, touting the purchase as a "massive increase in computing power" for the United States.

How exactly the IRS will use its SuperPod is unclear. An agency spokesperson said the IRS had no information to share on the supercomputer purchase, including which presidential administration ordered it. A 2024 report by the Treasury Inspector General for Tax Administration identified 68 different AI-related projects underway at the IRS; the Nvidia cluster is not named among them, though many were redacted. But some clues can be gleaned from the purchase materials. "The IRS requires a robust and scalable infrastructure that can handle complex machine learning (ML) workloads," the document explains. "The Nvidia Super Pod is a critical component of this infrastructure, providing the necessary compute power, storage, and networking capabilities to support the development and deployment of large-scale ML models."

The document notes that the SuperPod will be run by the IRS Research, Applied Analytics, and Statistics division, or RAAS, which leads a variety of data-centric initiatives at the agency. While no specific uses are cited, it states that this division's Compliance Data Warehouse project, which is behind this SuperPod purchase, has previously used machine learning for automated fraud detection, identity theft prevention, and generally gaining a "deeper understanding of the mechanisms that drive taxpayer behavior."

Data Storage

Western Digital Aims For 100TB Hard Drives by 2030 (tomshardware.com) 63

Western Digital plans to introduce its first heat-assisted magnetic recording (HAMR) drives in late 2026, with 36TB conventional magnetic recording (CMR) and 44TB shingled UltraSMR variants. Volume production won't begin until the first half of 2027, following qualification by cloud data center providers in late 2026.

The company projects that HAMR technology, combined with OptiNAND, increased platter count, and mechanical improvements, will enable drives reaching 80TB CMR and 100TB UltraSMR capacities around 2030 -- a departure from Western Digital's previous commitment to microwave-assisted magnetic recording (MAMR) in 2017, which evolved into the energy-assisted perpendicular magnetic recording (ePMR) technology used in current drives.
The Courts

News Orgs Say AI Firm Stole Articles, Spit Out 'Hallucinations' (arstechnica.com) 20

An anonymous reader quotes a report from Ars Technica: Conde Nast and several other media companies sued the AI startup Cohere today, alleging that it engaged in "systematic copyright and trademark infringement" by using news articles to train its large language model. "Without permission or compensation, Cohere uses scraped copies of our articles, through training, real-time use, and in outputs, to power its artificial intelligence ('AI') service, which in turn competes with Publisher offerings and the emerging market for AI licensing," said the lawsuit (PDF) filed in US District Court for the Southern District of New York. "Not content with just stealing our works, Cohere also blatantly manufactures fake pieces and attributes them to us, misleading the public and tarnishing our brands."

Conde Nast, which owns Ars Technica and other publications such as Wired and The New Yorker, was joined in the lawsuit by The Atlantic, Forbes, The Guardian, Insider, the Los Angeles Times, McClatchy, Newsday, The Plain Dealer, Politico, The Republican, the Toronto Star, and Vox Media. The complaint seeks statutory damages of up to $150,000 under the Copyright Act for each infringed work, or an amount based on actual damages and Cohere's profits. It also seeks "actual damages, Cohere's profits, and statutory damages up to the maximum provided by law" for infringement of trademarks and "false designations of origin."

In Exhibit A (PDF), the plaintiffs identified over 4,000 articles in what they called an "illustrative and non-exhaustive list of works that Cohere has infringed." Additional exhibits provide responses to queries (PDF) and "hallucinations" (PDF) that the publishers say infringe upon their copyrights and trademarks. The lawsuit said Cohere "passes off its own hallucinated articles as articles from Publishers."
Cohere said in a statement to Ars: "Cohere strongly stands by its practices for responsibly training its enterprise AI. We have long prioritized controls that mitigate the risk of IP infringement and respect the rights of holders. We would have welcomed a conversation about their specific concerns -- and the opportunity to explain our enterprise-focused approach -- rather than learning about them in a filing. We believe this lawsuit is misguided and frivolous, and expect this matter to be resolved in our favor."

Further reading: Thomson Reuters Wins First Major AI Copyright Case In the US
United States

UK Demand For a Back Door To Apple Data Threatens Americans, Lawmakers Say (msn.com) 94

Members of key congressional oversight committees wrote to the United States' new top intelligence official Thursday to warn that a British order demanding government access to Apple users' encrypted data imperils Americans. From a report: Ron Wyden, a Democrat on the Senate Intelligence Committee, and Andy Biggs, a Republican on the House Judiciary committee, wrote to just-sworn-in National Intelligence Director Tulsi Gabbard and asked her to demand the United Kingdom retract its order.

If the top U.S. ally does not back off, they said, Gabbard should consider limiting the deep intelligence sharing and cooperation on cybersecurity between the countries. The Post first reported the existence of the confidential British order last week. It directs Apple to create a back door into its Advanced Data Protection offering, which allows users to fully encrypt data from iPhones and Mac computers when putting it in Apple's iCloud storage. Apple cannot retrieve such content even when served with a court order, frustrating authorities looking for evidence of terrorism, child abuse and other serious crimes.

The order was issued under the Investigatory Powers Act, which allows the British Home Office to require technical cooperation from companies and forbids those companies from disclosing anything about the demands. It would apply globally, though the U.K. authorities would have to ask Apple for information stored by specific customers.

AI

Thomson Reuters Wins First Major AI Copyright Case In the US 54

An anonymous reader quotes a report from Wired: Thomson Reuters has won the first major AI copyright case in the United States. In 2020, the media and technology conglomerate filed an unprecedentedAI copyright lawsuit against the legal AI startup Ross Intelligence. In the complaint, Thomson Reuters claimed the AI firm reproduced materials from its legal research firm Westlaw. Today, a judge ruled (PDF) in Thomson Reuters' favor, finding that the company's copyright was indeed infringed by Ross Intelligence's actions. "None of Ross's possible defenses holds water. I reject them all," wrote US District Court of Delaware judge Stephanos Bibas, in a summary judgement. [...] Notably, Judge Bibas ruled in Thomson Reuters' favor on the question of fair use.

The fair use doctrine is a key component of how AI companies are seeking to defend themselves against claims that they used copyrighted materials illegally. The idea underpinning fair use is that sometimes it's legally permissible to use copyrighted works without permission -- for example, to create parody works, or in noncommercial research or news production. When determining whether fair use applies, courts use a four-factor test, looking at the reason behind the work, the nature of the work (whether it's poetry, nonfiction, private letters, et cetera), the amount of copyrighted work used, and how the use impacts the market value of the original. Thomson Reuters prevailed on two of the four factors, but Bibas described the fourth as the most important, and ruled that Ross "meant to compete with Westlaw by developing a market substitute."
"If this decision is followed elsewhere, it's really bad for the generative AI companies," says James Grimmelmann, Cornell University professor of digital and internet law.

Chris Mammen, a partner at Womble Bond Dickinson who focuses on intellectual property law, adds: "It puts a finger on the scale towards holding that fair use doesn't apply."
Encryption

UK Orders Apple To Let It Spy on Users' Encrypted Accounts (msn.com) 96

The UK government has ordered Apple to create a backdoor allowing access to encrypted cloud backups of users worldwide, Washington Post reported Friday, citing multiple sources familiar with the matter. The unprecedented demand, issued last month through a technical capability notice under the UK Investigatory Powers Act, requires Apple to provide blanket access to fully encrypted material rather than assistance with specific accounts.

Apple is likely to discontinue its encrypted storage service in the UK rather than compromise user security globally, the report said. The company would still face pressure to provide backdoor access for users in other countries, including the United States. The order was issued under Britain's 2016 Investigatory Powers Act, which makes it illegal to disclose such government demands, according to the report. While Apple can appeal to a secret technical panel and judge, the law requires compliance during any appeal process. The company told Parliament in March that the UK government should not have authority to decide whether global users can access end-to-end encryption.
Science

The Long Quest for Artificial Blood (newyorker.com) 25

Scientists are making significant advances in developing artificial blood substitutes, with two promising approaches emerging in 2025, the New Yorker reports. At the University of Maryland School of Medicine's Center for Blood Oxygen Transport and Hemostasis, researchers are testing ErythroMer, a synthetic nanoparticle that mimics red blood cells' oxygen-carrying capabilities. Simultaneously, the UK's National Health Service is conducting the first human trials of lab-grown blood cells.

These developments address critical blood shortages - of the 38% of Americans eligible to donate, less than 3% do so regularly. Traditional donated blood also has significant limitations: platelets last only 5 days, red blood cells 42 days, and all require careful refrigeration and blood-type matching. DARPA awarded $46 million in early 2023 to develop ErythroMer, seeing potential for battlefield medicine where traditional blood storage isn't feasible.

The synthetic blood can be stored as a powder and reconstituted when needed. There are still a lot of challenges, the report adds. The lab-grown blood currently costs about $75,000 per syringe compared to around $200 for a pint of donated blood, and production is limited to small quantities.
Open Source

RISC-V Mainboard For the Framework Laptop 13 Is Now Available (liliputing.com) 16

The DeepComputing RISC-V Mainboard that Framework announced last year for its 13-inch laptops is now available for $199. Liliputing reports: If you already have a Framework Laptop 13 with an Intel or AMD motherboard, the new board is a drop-in replacement. But if you don't have a Framework Laptop you can also use the mainboard as a standalone computer: Framework sells a $39 Cooler Master case that effectively turns its mainboards into mini desktop computers. The RISC-V Mainboard comes from a partnership between Framework and DeepComputing, the Chinese company behind the DC-ROMA laptops, which were some of the first notebook computers to ship with RISC-V processors.

The board features a StarFive JH7110 processor, which is a 1.5 GHz quad-core chip featuring SiFive U74 RISC-V CPU cores and Imagination BXE-4-32 graphics, 8GB of onboard RAM, and a a 64GB SD card for storage (there's also support for an optional eMMC module, but you'll need to bring your own). Since the board is designed to fit in existing laptop frames, it's the same size and shape as AMD or Intel models and has four USB ports in the same locations. But these ports are a little less versatile than the ones you might find on other Framework Laptop 13 Mainboards [...]. There's also a 3.5mm audio jack.
You can check out the new board via the Framework Marketplace.

Further reading: Late last year, Framework CEO Nirav Patel delivered one of the best live demos we've ever seen at a tech conference -- modifying a Framework Laptop from x86 to RISC-V live on stage.
Power

California Built the World's Largest Solar Power Tower Plant. Now It May Close (latimes.com) 88

"Sometimes, government makes a bad bet..." writes the Los Angeles Times. Opening in 2014, the Ivanpah concentrated solar plant "quickly became known as an expensive, bird-killing eyesore." Assuming that state officials sign off — which they most likely will, because the deal will lead to lower bills for PG&E customers — two of the three towers will shut down come 2026. Ivanpah's owners haven't paid off the project's $1.6-billion federal loan, and it's unclear whether they'll be able to do so. Houston-based NRG Energy, which operates Ivanpah and is a co-owner with Kelvin Energy and Google, said that federal officials took part in the negotiations to close PG&E's towers and that the closure agreement will allow the federal government "to maximize the recovery of its loans." It's possible Ivanpah's third and final tower will close, too. An Edison spokesperson told me the utility is in "ongoing discussions" with the project's owners and the federal government over ending the utility's contract.

It might be tempting to conclude government should stop placing bets and just let the market decide. But if it weren't for taxpayers dollars, large-scale solar farms, which in 2023 produced 17% of California's power, might never have matured into low-cost, reliable electricity sources capable of displacing planet-warming fossil fuels. More than a decade ago, federal loans helped finance some of the nation's first big solar-panel farms.

Not every government investment will be a winner. Renewable energy critics still raise the specter of Solyndra, a solar panel manufacturer that filed for bankruptcy in 2011 after receiving a $535-million federal loan. But on the whole, clean power investments have worked out. The U.S. Department of Energy reported that as of Dec. 31, it had disbursed $40.5 billion in loans. Of that amount, $15.2 billion had already been repaid. The federal government was on the hook for $1.03 billion in estimated losses but had reaped $5.6 billion in interest.

The article notes recent U.S. energy-related loans to a lithium mine in Nevada (close to $1 billion) and $15 billion to expand hydropower, upgrade power lines, and add batteries. Some of the loans won't get paid back "If federal officials are doing their jobs well," the article adds. "That's the risk inherent to betting on early-stage technologies." About the Ivanpah solar towers, they write "Maybe they never should have been built. They're too expensive, they don't work right, they kill too many birds... It's good that their time is coming to an end. But we should take inspiration from them, too: Don't get complacent. Keep trying new things."

PG&E says their objective at the time was partly to "support new technologies," with one senior director of commercial procurement noting "It's not clear in the early stages what technologies will work best and be most affordable for customers. Solar photovoltaic panels and battery energy storage were once unaffordable at large scale." But today they've calculated that ending their power agreements with Ivanpah would cost customers "substantially less." And once deactivated, Ivanpah's units "will be decommissioned, providing an opportunity for the site to potentially be repurposed for renewable PV energy production," NRG said in a statement.

The Las Vegas Review-Journal notes that instead the 3,500-acre, 386-megawatt concentrated thermal power plant used a much older technology, "a system of mirrors to reflect sunlight and generate thermal energy, which is then concentrated to power a steam engine." Throughout the day, 350,000 computer-controlled mirrors track the sunlight and reflect it onto boilers atop 459-foot towers to generate AC. Nowadays, photovoltaic solar has surpassed concentrated solar power and become the dominant choice for renewable, clean energy, being more cost effective and flexible... So many birds have been victims of the plant's concentrated sun rays that workers referred to them as "streamers," for the smoke plume that comes from birds that ignite in midair. When federal wildlife investigators visited the plant around 10 years ago, they reported an average of one "streamer" every two minutes.
"Meanwhile, environmentalists continue to blame the Mojave Desert plant for killing thousands of birds and tortoises," reports the Associated Press. And a Sierra Club campaign organizer also says several rare plant species were destroyed during the plant's construction. "While the Sierra Club strongly supports innovative clean energy solutions and recognizes the urgent need to transition away from fossil fuels, Ivanpah demonstrated that not all renewable technologies are created equal."
Google

Apple Battles For Role in Google Antitrust Trial, Warning of Serious Risks (courtlistener.com) 23

Apple has filed an emergency motion [PDF] for a stay in the Google antitrust trial, warning that it faces "clear and substantial irreparable harm" if barred from participating in the case's remedies phase. The motion, filed on January 30, 2025, comes after Judge Amit Mehta denied Apple's request for limited intervention earlier in the week.

Apple -- which makes more than $20 billion a year from Google to use the Android-maker's search engine on Safari -- argues that the U.S. Department of Justice's (DOJ) proposed remedy -- which includes a prohibition on "any contract between Google and Apple in which there would be anything exchanged of value" --would prevent it from negotiating agreements that benefit millions of users. Without the ability to fully participate, Apple contends it will be left as a "mere spectator" while the government pursues restrictions that directly impact its business interests.

The company asserts that intervention is necessary to develop evidence, participate in discovery, and cross-examine witnesses regarding its market role and incentives. Apple also seeks access to trial records while its appeal is pending, including witness lists, depositions, and discovery materials, to ensure it can respond effectively if granted party status.
Microsoft

Microsoft Slaps $400 Premium on Intel-powered Surface Lineup (theregister.com) 60

Microsoft is charging business customers a $400 premium for Surface devices equipped with Intel's latest Core Ultra processors compared to models using Qualcomm's Arm-based chips, the company has disclosed. The Intel-powered Surface Pro tablet and Surface Laptop, starting at $1,499, come with a second-generation Core Ultra 5 processor featuring eight cores, 16GB of memory and 256GB storage.

Comparable Qualcomm-based models begin at $1,099. The new Intel devices will be available to business customers from February 18, though versions with cellular connectivity will launch later. Consumer Surface devices will only be offered with Qualcomm processors. Microsoft also unveiled a USB 4 Dock supporting dual 4K displays and the Surface Hub 3, a conference room computer available in 50-inch or 85-inch touchscreen versions.
Data Storage

Archivists Work To Identify and Save the Thousands of Datasets Disappearing From Data.gov (404media.co) 70

An anonymous reader quotes a report from 404 Media: Datasets aggregated on data.gov, the largest repository of U.S. government open data on the internet, are being deleted, according to the website's own information. Since Donald Trump was inaugurated as president, more than 2,000 datasets have disappeared from the database. As people in the Data Hoarding and archiving communities have pointed out, on January 21, there were 307,854 datasets on data.gov. As of Thursday, there are 305,564 datasets. Many of the deletions happened immediately after Trump was inaugurated, according to snapshots of the website saved on the Internet Archive's Wayback Machine. Harvard University researcher Jack Cushman has been taking snapshots of Data.gov's datasets both before and after the inauguration, and has worked to create a full archive of the data.

"Some of [the entries link to] actual data," Cushman told 404 Media. "And some of them link to a landing page [where the data is hosted]. And the question is -- when things are disappearing, is it the data it points to that is gone? Or is it just the index to it that's gone?" For example, "National Coral Reef Monitoring Program: Water Temperature Data from Subsurface Temperature Recorders (STRs) deployed at coral reef sites in the Hawaiian Archipelago from 2005 to 2019," a NOAA dataset, can no longer be found on data.gov but can be found on one of NOAA's websites by Googling the title. "Stetson Flower Garden Banks Benthic_Covage Monitoring 1993-2018 -- OBIS Event," another NOAA dataset, can no longer be found on data.gov and also appears to have been deleted from the internet. "Three Dimensional Thermal Model of Newberry Volcano, Oregon," a Department of Energy resource, is no longer available via the Department of Energy but can be found backed up on third-party websites. [...]

Data.gov serves as an aggregator of datasets and research across the entire government, meaning it isn't a single database. This makes it slightly harder to archive than any individual database, according to Mark Phillips, a University of Northern Texas researcher who works on the End of Term Web Archive, a project that archives as much as possible from government websites before a new administration takes over. "Some of this falls into the 'We don't know what we don't know,'" Phillips told 404 Media. "It is very challenging to know exactly what, where, how often it changes, and what is new, gone, or going to move. Saving content from an aggregator like data.gov is a bit more challenging for the End of Term work because often the data is only identified and registered as a metadata record with data.gov but the actual data could live on another website, a state .gov, a university website, cloud provider like Amazon or Microsoft or any other location. This makes the crawling even more difficult."

Phillips said that, for this round of archiving (which the team does every administration change), the project has been crawling government websites since January 2024, and that they have been doing "large-scale crawls with help from our partners at the Internet Archive, Common Crawl, and the University of North Texas. We've worked to collect 100s of terabytes of web content, which includes datasets from domains like data.gov." [...] It is absolutely true that the Trump administration is deleting government data and research and is making it harder to access. But determining what is gone, where it went, whether it's been preserved somewhere, and why it was taken down is a process that is time intensive and going to take a while. "One thing that is clear to me about datasets coming down from data.gov is that when we rely on one place for collecting, hosting, and making available these datasets, we will always have an issue with data disappearing," Phillips said. "Historically the federal government would distribute information to libraries across the country to provide greater access and also a safeguard against loss. That isn't done in the same way for this government data."

Government

OPM Sued Over Privacy Concerns With New Government-Wide Email System (thehill.com) 44

An anonymous reader quotes a report from the Hill: Two federal employees are suing the Office of Personnel Management (OPM) to block the agency from creating a new email distribution system -- an action that comes as the information will reportedly be directed to a former staffer to Elon Musk now at the agency. The suit (PDF), launched by two anonymous federal employees, ties together two events that have alarmed members of the federal workforce and prompted privacy concerns. That includes an unusual email from OPM last Thursday reviewed by The Hill said the agency was testing "a new capability" to reach all federal employees -- a departure from staffers typically being contacted directly by their agency's human resources department.

Also cited in the suit is an anonymous Reddit post Monday from someone purporting to be an OPM employee, saying a new server was installed at their office after a career employee refused to set up a direct line of communication to all federal employees. According to the post, instructions have been given to share responses to the email to OPM chief of staff Amanda Scales, a former employee at Musk's AI company. Federal agencies have separately been directed to send Scales a list of all employees still on their one-year probationary status, and therefore easier to remove from government. The suit says the actions violate the E-Government Act of 2002, which requires a Privacy Impact Assessment before pushing ahead with creation of databases that store personally identifiable information.

Kel McClanahan, executive director of National Security Counselors, a non-profit law firm, noted that OPM has been hacked before and has a duty to protect employees' information. "Because they did that without any indications to the public of how this thing was being managed -- they can't do that for security reasons. They can't do that because they have not given anybody any reason to believe that this server is secure.that this server is storing this information in the proper format that would prevent it from being hacked," he said. McClanahan noted that the emails appear to be an effort to create a master list of federal government employees, as "System of Records Notices" are typically managed by each department. "I think part of the reason -- and this is just my own speculation -- that they're doing this is to try and create that database. And they're trying to sort of create it by smushing together all these other databases and telling everyone who receives the email to respond," he said.

Slashdot Top Deals