28 Apr 15:30

Google goes cold on Europe: Stops making smart thermostats for continental conditions

by Simon Sharwood

And just-about bricks some of its older models everywhere

Google has given up on smart thermostats in Europe.…

28 Apr 15:20

Government says - for now - it will restore international students' status

by Adrian Florido

Johnathan Smith
"He said immigration attorneys have documented close to 5,000 students whose SEVIS records have been terminated in recent weeks. It's unclear whether the government will restore all of them, or only those of students who have sued."

Students and educators attend a rally at Northwestern University one week before the Trump administration said it will restore the records of international students deleted from a crucial database. That move had thrown into doubt many students

International students had filed dozens of lawsuits after the government removed them from a database crucial for maintaining their legal status.

(Image credit: Nam Y. Huh)

28 Apr 15:00

Windows will now let you swear at it — introduces toggle to disable profanity filter for voice typing

Due to popular demand, Microsoft is introducing a profanity filter toggle to Windows 11 Voice Typing.

View attached file (8LLf88JPwnDLEWkjYpKeJK.jpg, image/jpeg)

25 Apr 21:15

Devs sound alarm after Microsoft subtracts C/C++ extension from VS Code forks

by Thomas Claburn

Cursor, Codium makers lose access as add-on goes exclusive

Microsoft's C/C++ extension for Visual Studio Code (VS Code) no longer works with derivative products such as VS Codium and Cursor – and some developers are crying foul.…

24 Apr 22:40

NIH autism study will pull from private medical records

by Alana Wise

Johnathan Smith
How is this legal?

NIH Director Jay Bhattacharya (right), accompanied by HHS Secretary Robert F. Kennedy Jr. (left) and FDA Commissioner Marty Makary (center), speaks during a news conference Tuesday at the Health and Human Services Department on in Washington, D.C.

The National Institutes of Health plans to pool information from private sources like pharmacies and smartwatches.

(Image credit: Andrew Harnik)

24 Apr 22:39

ICE enlists Palantir to develop all-seeing 'ImmigrationOS' eye to speed up deportations

by Brandon Vigliarolo

Only Peter Thiel-backed biz can pull off $30M IT deal, apparently

US Immigration and Customs Enforcement has an urgent need for a new software system to help implement the Trump administration's deportation plans, and it's turning to longtime ICE supplier Palantir for a rush build job.…

24 Apr 15:11

A dozen states sue the Trump administration to stop tariff policy

by The Associated Press

Johnathan Smith
I wonder if a state can arrange to just not have tariffs collected at its ports? Seems just as legal as the imposition to begin with.

An airplane flies over the container ship Alexandra at the Port Newark Container Terminal on April 18 in Newark, N.J.

A dozen states have sued the Trump administration in the U.S. Court of International Trade to stop its tariff policy, challenging Trump's claim that he could arbitrarily impose tariffs based on the International Emergency Economic Powers Act.

(Image credit: Julia Demaree Nikhinson)

24 Apr 14:56

(PR) Bethesda Celebrates Surprise Launch of The Elder Scrolls IV: Oblivion Remastered

by T0@st

Johnathan Smith
Maybe I'll finally play this one.

It's been thrilling for all of us here at Bethesda to revisit a game so close to our hearts with Oblivion Remastered. When we started this project in 2021, we aimed to breathe new life into a chapter of The Elder Scrolls that set the path for so many of our games after it. We never wanted to remake it—but remaster it—where the original game was there as you remember playing it, but seen through today's technology.

We were so fortunate to work with an unbelievable team and partners at Virtuos. Even though we had worked together before, we had never attempted something on this scale. Every piece of art, animation, special effects, and part of the world would be remastered. Some new voices would be recorded, while keeping the originals there as well. Game systems were updated to feel better in your hands. Leveling systems modified for smoother progression and balance. We looked at every part and carefully upgraded it. But most of all, we never wanted to change the core. It's still a game from a previous era and should feel like one.

Read full story

View attached file (imhw6vlFXvmyueLf.jpg, image/jpeg)

20 Apr 19:02

Israeli probe into killing of 15 Palestinian medics finds 'professional failures'

by The Associated Press

Mourners gather around the bodies of 8 Red Crescent emergency responders, recovered in Rafah a week after an Israeli attack, as they are transported for burial from a hospital in Deir al-Balah, Gaza Strip, on Monday, March 31, 2025.

The Israeli military investigation said the examination found "no evidence to support claims of execution or that any of the deceased were bound before or after the shooting."

(Image credit: Abdel Kareem Hana)

19 Apr 15:33

Krebs throws himself on the grenade, resigns from SentinelOne after Trump revokes clearances

by Jessica Lyons

Illegitimi non carborundum? Nice password, Mr Ex-CISA

Chris Krebs, the former head of the US Cybersecurity and Infrastructure Security Agency (CISA) and a longtime Trump target, has resigned from SentinelOne following a recent executive order that targeted him and revoked the security clearances of everybody at the company.…

19 Apr 15:23

CVE program gets last-minute funding from CISA – and maybe a new home

by Jessica Lyons

Johnathan Smith
This can't cost very much compared to what it provides. I mean just lump funding this in with 'national security'...

Uncertainty is the new certainty

In an 11th-hour reprieve, the US government last night agreed to continue funding the globally used Common Vulnerabilities and Exposures (CVE) program.…

19 Apr 15:10

Dot com era crash on the cards for AI datacenter spending? It's a 'risk'

by Paul Kunert

Analysts say the bubble won't burst, but it is possible, admits world's largest colo provider

Interview Those who ignore history are destined to repeat mistakes of the past and, with signs of an inflating bit barn spending bubble, comparisons are being made with the infamous dotcom bust a quarter of a century ago.…

17 Apr 15:03

Intel's 18A Node Outperforms TSMC N2 and Samsung SF2 in 2 nm Performance Class

by AleksandarK

Intel's 18A node isn't all about yields and density (which are still very important factors) but also performance. According to Taiwanese media 3C News, citing TechInsights research and calculations, the new leader of node performance is Intel 18A. On a custom scale used by TechInsights, Intel 18A gets a 2.53 score, while the performance score of TSMC N2 is 2.27, and the performance score of Samsung SF2 is 2.19. This is all among two nm-class nodes, where Intel leads the category. Being the first node with a Backside Power Delivery Network (BSPDN), it will appear in the Panther Lake CPUs in late 2025 for testing and early 2026 for shipments. This new power architecture boosts layout efficiency and component utilization by 5-10%, lowers interconnect resistance, and enhances ISO power performance by up to 4%, thanks to a significant drop in intrinsic resistance versus traditional front‑end power routing. Relative to its predecessor, Intel 3, the 18A process delivers a 15% improvement in performance per watt and packs 30% more transistors into the same area.

Featuring RibbonFET design, it has entered risk production. According to Intel, "This final stage is about stress-testing volume manufacturing before scaling up to high volume in the second half of 2025." When it comes to other aspects like SRAM density, high‑performance SRAM cells shrank from 0.03 µm² in Intel 3 to 0.023 µm² in Intel 18A, while high‑density cells contracted to 0.021 µm², reflecting scaling factors of 0.77 and 0.88 respectively and defying previous assumptions that SRAM scaling had plateaued. Intel's innovative "around‑the‑array" PowerVia approach addresses voltage drops and interference by routing power vias to I/O, control, and decoder circuits, freeing up the bit‑cell area from frontal power supplies. The result is a 38.1 Mbit/mm² macro bit density, positioning Intel to rival TSMC's N2. All this, combined with BSPDN, is shaping up a powerful node. We can't wait to get our hands on some 18A silicon in the future and run it through our labs for testing.

View attached file (cMv53rwqInjRngZv.jpg, image/jpeg)

17 Apr 14:55

Unpowered SSD endurance investigation finds severe data loss and performance issues

A year-two update on the How long can SSDs store data unpowered video series is another reminder about the importance of regular backups.

View attached file (W7vXqDuaaPJa4CV6hZQyGV.jpg, image/jpeg)

17 Apr 14:40

'Homegrowns are next': Trump hopes to deport and jail U.S. citizens abroad

by Brian Mann

Johnathan Smith
Seems like this is crossing a line even for republicans since none of them or their thinktanks would talk about it to NPR. Only the Cato institute would say something and it wasn't supportive at all.

U.S. President Donald Trump met with President Nayib Bukele of El Salvador in the Oval Office of the White House April 14, 2025 in Washington, D.C. Trump and Bukele are exploring a proposal to send American citizens to be held in Salvadorean prisons, a move critics describe as dangerous and unconstitutional.

Trump hopes to deport and imprison U.S. citizens abroad. Critics say the concept is unconstitutional and dangerous.

(Image credit: Win McNamee)

16 Apr 15:16

Trump administration freezes more than $2.2 billion after Harvard rejects its demands

by Jonaki Mehta

Harvard University has refused to make changes in hiring, admissions and DEI programs.

The government announced it is freezing more than $2.2 billion, hours after the university refused to make changes it said would "dictate what private universities can teach."

(Image credit: Brian Snyder)

12 Apr 22:10

China put steep tariffs on U.S. exports. Farmers are worried

by Danielle Kurtzleben

Secretary of Agriculture Brooke Rollins speaks with reporters outside the White House on March 26, 2025.

The first Trump administration spent $28 billion bailing out farmers during a trade war with China. The White House has said it's starting to look at how to help this time around.

(Image credit: Win McNamee)

12 Apr 21:55

Trump exempts computers, smartphones, and more from tariffs

The tech industry is delighted that computers, smartphones, semiconductor chips, and more are officially exempt from tariffs on foreign imports.

View attached file (kbNTzpNk6QYUEDpAuxQqFZ.jpg, image/jpeg)

11 Apr 15:14

Beijing slaps 125% tariffs on U.S. goods in latest U.S.-China trade escalation

by Sherisse Pham

Johnathan Smith
I wonder how many businesses in the US will be obliterated by this. Shipping between the two must be going from a huge amount to basically zero very soon. This is all so capricious and dumb to start a trade war.

People stand outside a brokerage house as an electronic board displays shares trading index in the Central Business District, in Beijing on Thursday.

China signals the latest tariff hike will be its last round of tit-for-tat measures, prompting sharp falls in European shares, as Asian stocks end the day mixed.

(Image credit: Andy Wong)

11 Apr 15:03

The sound of Windows 95 about to disappoint you added to Library of Congress significant sound archive

by Simon Sharwood

Along with Celine Dion and Elton John - plus some good music too

The Brian-Eno-composed sound played by Windows 95 when booted has been added to the US Library of Congress’s list of nationally significant recordings.…

11 Apr 15:01

Ex-Meta exec tells Senate Zuck dangled US citizen data in bid to enter China

by Iain Thomson

Former policy boss claims Facebook cared little about national security as it chased the mighty Yuan

Facebook's former director of global public policy told a Senate committee that Meta CEO Mark Zuckerberg was willing to do almost anything to get the social network into China - including, she alleged, offering up Americans' data.…

10 Apr 15:05

DOGE dilettantes 'didn't test' Social Security fraud detection tool at appropriate scale

by Iain Thomson

Feds claim creaky COBOL, user spike is real reason key portal now flaky

The United States Social Security Administration's internet portal has frequently gone offline in recent weeks, and presented inaccurate or incomplete information to users, perhaps because of changes steered by Elon Musk's cost-trimming DOGE unit.…

09 Apr 15:07

TSMC Faces $1 Billion Fine from US Government Over Shipments to Huawei

by AleksandarK

TSMC is confronting a potential $1 billion-plus penalty from the US Commerce Department after inadvertently fabricating compute chiplets later integrated into Huawei's Ascend 910 AI processor. The fine, potentially reaching twice the value of unauthorized shipments, reflects the scale of components that circumvented export controls limiting Chinese access to advanced semiconductor technology. The regulatory breach originated in late 2023 when TSMC processed orders from Sophgo, a design partner of crypto-mining firm Bitmain. These chiplets, which are manufactured on advanced process nodes and contain tens of billions of transistors, were identified in TechInsights teardown analysis of Huawei Ascend 910 AI accelerator, revealing a supply chain vulnerability where TSMC lacked visibility into the components' end-use.

Upon discovery of the diversion, TSMC immediately halted Sophgo shipments and engaged in discussions with Commerce Department officials. By January, Sophgo had been added to the Entity List, limiting its access to US semiconductor technology. A Center for Strategic and International Studies report revealed that Huawei obtained approximately two million Ascend 910B logic dies through shell companies that misled TSMC. Huawei's preference for TSMC-made dies was due to manufacturing challenges in domestic chip production. This incident has forced TSMC to strengthen its customer vetting protocols, including terminating its relationship with Singapore-based PowerAIR following internal compliance reviews. The enforcement process typically begins with a proposed charging letter detailing violations and penalty calculations, followed by a 30-day response period. As Washington tightens restrictions on AI processor exports to Chinese entities, semiconductor manufacturers are under increased pressure to implement rigorous controls throughout multinational supply chains.

View attached file (5PrfHE4eo0bUUnmv.jpg, image/jpeg)

08 Apr 15:21

Please sir, may we have some Moore? Doesn't look that way

by Rupert Goodwins

Johnathan Smith
Reminds me of how I'm currently working with some traditional software guys to do some real time embedded programming. They went from "this is impossible" to "its in real time" with some guidance over about a month. Even starting in C++ and using some fancy libraries they had >20x performance improvement still laying on the table. I don't know how the software community fixes their collective performance problem but I know my use case isn't unique.

We're on a roadmap to nowhere. Come on inside

Opinion Nvidia has just shown off its vision of the near future in the shape of its Blackwell Ultra. Aptly for a company that helps gamers explore dystopian science-fiction hellscapes, Nvidia's actual future involves vast, heat-soaked stacks of silicon, guzzling energy by the half-gigawatt.…

08 Apr 14:50

U.S. Gov't eliminates tape data storage at the GSA to save $1M per year, but tape isn't dead yet

Johnathan Smith
Another stupid decision that will cost money long term.

But the triumphant social media post gets a Community Note.

View attached file (TKmiN4ZDMCxVcBexCi7w4a.jpg, image/jpeg)

07 Apr 20:47

Looking Ahead at Intel’s Xe3 GPU Architecture

by Chester Lam

Johnathan Smith
The most interesting thing is the commenter that breaks down exactly how Intel is losing money on the existing Xe2 series cards.

Intel’s foray into high performance graphics has enjoyed impressive progress over the past few years, and the company is not letting up on the gas. Tom Peterson from Intel has indicated that Xe3 hardware design is complete, and software work is underway. Some of that software work is visible across several different open source repositories, offering a preview of what’s to come.

GPU Organization: Larger Render Slices?

Modern GPUs are built from a hierarchy of subdivision levels, letting them scale to hit different performance, power and price targets. A shader program running on an Intel GPU can check where it’s running by reading the low bits of the sr0 (state register 0) architectural register.

sr0 topology bits on Xe3 have a different layout¹. Xe Cores within a Render Slice are enumerated with four bits, up from two in prior generations. Thus Xe3’s topology bits would be able to handle a Render Slice with up to 16 Xe Cores. Prior Xe generations could only have four Xe Cores per Render Slice, and often went right up to that. The B580 and A770 both placed four Xe Cores in each Render Slice.

Having enough bits to describe a certain configuration doesn’t mean Intel will ship something that big. Xe did use its maximum 32 core, 4096 lane setup in the Arc A770. However, Xe2 maxed out at 20 cores and 2560 lanes with the Arc B580. Xe2’s sr0 format could theoretically enumerate 16 slices. Giving each slice the maximum of 4 Xe Cores would make a 64 Xe Core GPU with 8192 FP32 lanes. Obviously the B580 doesn’t get anywhere near that.

Visualizing the shader array on a hypothetical giant Xe2 implementation that maxes out all topology enumeration bits

Xe3 goes even further. Maxing out all the topology enumeration bits would result in a ludicrously large 256 Xe Core configuration with 32768 FP32 lanes. That’s even larger than Nvidia’s RTX 5090, which “only” has 21760 FP32 lanes. Intel has been focusing on the midrange segment for a while, and I doubt we’ll see anything that big.

Instead, I think Intel wants more flexibility to scale compute power independently of fixed function hardware like ROPs and rasterizers. AMD and Nvidia’s SAs and GPCs all pack a lot more than four cores. For example, the RX 6900XT’s Shader Engines each have 10 WGPs. Nvidia’s RTX 4090 puts eight SMs in each GPC. GPUs have become more compute-heavy over time, as games use more complex shader programs. Intel seems to be following the same trend.

XVE Changes

Xe Vector Engines (XVEs) execute shader programs on Intel GPUs. They use a combination of vector-level and thread-level parallelism to hide latency.

Higher Occupancy, Increased Parallelism

Xe3 XVEs can run 10 threads concurrently, up from eight in prior generations. Like SMT on a CPU, tracking multiple threads helps a XVE hide latency using thread level parallelism. If one thread stalls, the XVE can hopefully find an un-stalled thread to issue instructions from. Active thread count is also referred to as thread occupancy. 100% occupancy on a GPU would be analogous to 100% utilization in Windows Task Manager. Unlike CPU SMT implementations, GPU occupancy can be limited by register file capacity.

Prior Intel GPUs had two register allocation modes. Normally each thread gets 128 512-bit registers, for 8 KB of registers per thread. A “large GRF” mode gives each thread 256 registers, but drops occupancy to 4 threads because of register file capacity limits. Xe3 continues to use 64 KB register files per XVE, but flexibly allocates registers in 32 entry blocks². That lets Xe3’s XVEs get 10 threads in flight as long as each thread uses 96 or fewer registers. If a shader program needs a lot of registers, occupancy degrades more gracefully than in prior generations.

Nvidia and AMD GPUs allocate registers at even finer granularity. AMD’s RDNA 2 for example allocates registers in blocks of 16. But Xe3 is still more flexible than prior Intel generations. With this change, simple shaders that only need a few registers will enjoy better latency tolerance from more thread-level parallelism. And more complex shaders can avoid dropping to the “large GRF” mode.

Xe3’s XVEs have more scoreboard tokens too. Like AMD and Nvidia, Intel uses compiler assisted scheduling for long latency instructions like memory accesses. A long latency instruction can set a scoreboard entry, and a dependent instruction can wait until that entry is cleared. Each Xe3 thread gets 32 scoreboard tokens regardless of occupancy, so a XVE has 320 scoreboard tokens in total. On Xe2, a thread gets 16 tokens if the XVE is running eight threads, or 32 in “large GRF” mode with four threads. Thus Xe2’s XVEs only have 128 scoreboard tokens in total. More tokens let thread have more outstanding long latency instructions. That very likely translates to more memory level parallelism per thread.

“Scalar” Register (s0)

Intel’s GPU ISA has a vector register file (GRF, or General Register File) that stores much of a shader program’s data and feeds the vector execution units. It also has an “Architecture Register File” (ARF) with special registers. Some of those can store data, like the accumulator registers. But others serve special purposes. For example, sr0 as mentioned above provides GPU topology info, along with floating point exception state and thread priority. A 32-bit instruction pointer points to the current instruction address, relative to the instruction base address.

Notes on Intel’s ARF, with changes from Xe2 to Xe3 in blue

Xe3 adds a “Scalar Register” (s0) to the ARF⁶. s0 is laid out much like the address register (a0), and is used for gather-send instructions. XVEs access memory and communicate with other shared using by sending messages over the Xe Core’s message fabric, using send instructions. Gather-send appears to let Xe3 gather non-contiguous values from the register file, and send them with a single send instruction.

Besides adding the Scalar Register, Xe3 extends the thread dependency register (TDR) to handle 10 threads. sr0 gains an extra 32-bit doubleword for unknown reasons.

Instruction Changes

Xe3 supports a saturation modifier for FCVT, an instruction that converts between different floating point types (not between integer and floating point). FCVT was introduced with Ponte Vecchio, but the saturation modifier could ease conversion from higher to lower precision floating point formats. Xe3 also gains HF8 (half float 8-bit) format support, providing another 8-bit floating point format option next to the BF8 type already supported in Xe2.

For the XMX unit, Xe3 gains a xdpas instruction⁴. sdpas stands for sparse systolic dot product with accumulate⁵. Matrices with a lot of zero elements are known as sparse matrices. Operations on sparse matrices can be optimized because anything multiplied by zero is obviously zero. Nvidia and AMD GPUs have both implemented sparsity optimizations, and Intel is apparently looking to do the same.

Raytracing: Sub-Triangle Opacity Culling

Sub-Triangle Opacity Culling (STOC) subdivides triangles in BVH leaf nodes, and marks sub-triangles as transparent, opaque, or partially transparent. The primary motivation is to reduce wasted any-hit shader work when games use texture alpha channels to handle complex geometry. Intel’s paper calls out foliage as an example, noting that programmers may use low vertex counts to reduce “rendering, animation, and even simulation run times.”⁷ BVH geometry from the API perspective can only be completely transparent or opaque, so games mark all partially transparent primitives as transparent. Each ray intersection will fire an any-hit shader, which carries out alpha testing. If alpha testing indicates the ray intersected a transparent part of the primitive, the shader program doesn’t contribute a sample and the any-hit shader launch is basically wasted. STOC bits let the any-hit shader skip alpha testing if the ray intersects a completely transparent or completely opaque sub-triangle.

From Intel's paper, showing examples of foliage textures

Storing each sub-triangle’s opacity information takes two bits, so STOC does require more storage compared to using a single opacity bit for the entire triangle. Still, it’s far more practical than packing entire textures into the BVH. Intel’s paper found that a software-only STOC implementation improved performance by 5.9-42.2% compared to standard alpha tests when handling translucent ray-traced shadows.

Elden Ring's BVH represents tree leaves in Leyndell using larger triangles, as seen in Radeon Raytracing Analyzer. STOC may map well to this scenario

STOC-aware raytracing hardware can provide further gains, especially with Intel's raytracing implementation. Intel's raytracing acceleration method closely aligns with the DXR 1.0 standard. A raytracing accelerator (RTA) autonomously handles traversal and launches hit/miss shaders by sending messages to the Xe Core's thread dispatcher. STOC bits could let the RTA skip shader launches if the ray intersects a completely transparent sub-triangle. For an opaque sub-triangle, the RTA can tell the shader program to skip alpha testing, and terminate the ray early.

Illustrating the problem STOC tries to solve. Yes I used rectangles, but I wanted to use the same leaf texture outline as Intel's paper. And Intel's leaf nodes store rectangles (triangle pairs) anyway

Xe3 brings STOC bits into hardware raytracing data structures with two levels of sophistication. A basic implementation retains 64B leaf nodes, but creatively finds space to fit 18 extra bits. Intel's QuadLeaf structure represents a merged pair of triangles. Each triangle gets 8 STOC bits, implying four sub-triangles. Another two bits indicate whether the any-hit shader should do STOC emulation in software, potentially letting programmers turn off hardware STOC for debugging. This mode is named "STOC1" in code.

Sketching out triangle (leaf) node formats for Xe/Xe2 and Xe3. Blue = STOC related, purple = non-STOC raytracing data structure changes

A “STOC3” structure takes things further by storing pointers to STOC bits rather than embedding them into the BVH. That allows more flexibility in how much storage the STOC bits can use. STOC3 also specifies recursion levels for STOC bits, possibly for recursively partitioning triangles. Subdividing further would reduce the number of partially transparent sub-triangles, which require alpha testing from the any-hit shader. Storing pointers for STOC3 brings leaf node size to 128 bytes, increasing BVH memory footprint.

Possible performance gains are exciting, but using STOC requires work from game developers or game engines. Intel suggests that STOC bits can be generated offline as part of game asset compilation. Artists will have to determine whether using STOC will provide a performance uplift for a particular scene. A scene with a lot of foliage might benefit massively from STOC. A chain link fence may be another story. STOC isn’t a part of the DirectX or Vulkan standards, which can be another obstacle to adoption. However, software-only STOC can still provide benefits. That could encourage developers to try it out. If they do implement it, STOC-aware Xe3 hardware stands to gain more than a software-only solution.

Final Words

We’re still some time away from real Xe3 products. But software changes suggest Xe3 is another significant step forward for Intel’s graphics architecture. Xe2 was a solid step in Intel’s foray into discrete graphics, providing better performance than Xe with a nominally smaller GPU. Xe3 tweaks the architecture again and likely has similar goals. Higher occupancy and dynamic register allocation would make Xe Cores more latency tolerant, improving utilization. Those changes also bring Intel’s graphics architecture closer to AMD and Nvidia’s.

XVE changes show Intel is still busy evolving their core compute architecture. In contrast, Nvidia’s Streaming Multiprocessors haven’t seen significant changes from Ampere to Blackwell. Nvidia may have felt Ampere’s SM architecture was good enough, and turned their efforts to tuning features while scaling up the GPU to keep providing generational gains. Intel meanwhile seeks to get more out of each Xe Core (and Xe2 achieved higher performance than Xe with fewer Xe Cores).

In a similarity with Nvidia, Intel is pushing hard on the features front and evidently has invested into research. GPUs often try to avoid doing wasted work. Just rasterization pipelines use early depth testing to avoid useless pixel shader invocations, STOC avoids spawning useless any-hit shaders. It’s too early to tell what kind of difference STOC or other Xe3 features will make. But anyone doubting Intel’s commitment to moving their GPU architecture forward should take a serious look at Mesa and Intel Graphics Compiler changes. There’s a lot going on, and I look forward to seeing Xe3 whenever it’s ready.

View attached file (https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12874222-8008-4367-9262-7a02626cb3b6_810x576.jpeg, image/jpeg)

07 Apr 20:42

Dev loudly complained about older colleague, who retired not long after

by Simon Sharwood

The graybeard wasn't doing a great job and morale improved once he left. How would you handle this?

Who, Me? It's hard to confront the start of a working week, but each Monday morning, The Register tries to keep the weekend fun going for another few minutes by delivering a fresh edition of Who, Me? It's the column in which we take your tales of your most ticklish moments at work and share them for the amusement and/or education of your fellow readers.…

07 Apr 14:53

Meta debuts its first 'mixture of experts' models from the Llama 4 herd

by Simon Sharwood

Johnathan Smith
"Meta also claimed it has fixed the tendency for large language models to deliver results that align with left wing political thought.

"It's well-known that all leading LLMs have had issues with bias — specifically, they historically have leaned left when it comes to debated political and social topics," Meta's launch post states, before attributing that to "the types of training data available on the internet.""

Says they’re done right as they don’t lean so far left

Meta has debuted the first two models in its Llama 4 family, its first to use mixture of experts tech.…

07 Apr 14:40

Asia GPMI vs HDMI: Comparing Bandwidth and Power Delivery

Johnathan Smith
Coming straight for hdmi and usb it seems.

A group of over 50 Chinese companies is developing a new standard called the General Purpose Media Interface (GPMI). This new tech will handle 192 Gbit/s of bandwidth and can deliver up to 480W of power.

07 Apr 04:06

Asian markets plunge with Japan's Nikkei diving nearly 8% after big Wall St. meltdown

by The Associated Press

Currency traders work near a screen showing the Korea Composite Stock Price Index (KOSPI), top left, and the foreign exchange rate between U.S. dollar and South Korean won, top center, at the foreign exchange dealing room of the KEB Hana Bank headquarters in Seoul, South Korea, Monday, April 7, 2025.

Asian shares nosedived on Monday after the meltdown Friday on Wall Street over U.S. President Donald Trump's tariff hikes and the backlash from Beijing.

(Image credit: Ahn Young-joon)

Johnathan Smith

Shared posts

And just-about bricks some of its older models everywhere

Cursor, Codium makers lose access as add-on goes exclusive

Only Peter Thiel-backed biz can pull off $30M IT deal, apparently

Illegitimi non carborundum? Nice password, Mr Ex-CISA

Uncertainty is the new certainty

Analysts say the bubble won't burst, but it is possible, admits world's largest colo provider

Along with Celine Dion and Elton John - plus some good music too

Former policy boss claims Facebook cared little about national security as it chased the mighty Yuan

Feds claim creaky COBOL, user spike is real reason key portal now flaky

We're on a roadmap to nowhere. Come on inside

GPU Organization: Larger Render Slices?

XVE Changes

Higher Occupancy, Increased Parallelism

“Scalar” Register (s0)

Instruction Changes

Raytracing: Sub-Triangle Opacity Culling

Final Words

The graybeard wasn't doing a great job and morale improved once he left. How would you handle this?

Says they’re done right as they don’t lean so far left