Arm Announces New Ethos-N57 and N37 NPUs, Mali-G57 Valhall GPU and Mali-D37 DPU
by Andrei Frumusanu on October 23, 2019 12:00 AM EST- Posted in
- Mobile
- Arm
- Smartphones
- Mali
- Machine Learning
- Neural Networks
- NPU
- IP
- Ethos
Today Arm is announcing four new products in its NPU, GPU and DPU portfolio. The company is branding its in-house machine learning processor IPs the Ethos line-up detailing more the existing N77 piece and also revealing the smaller N57 and N37 siblings in the family. To top things off, the company is also making ready its first mid-range GPU IP based on the brand-new Valhall architecture, the new Mali-G57. Finally, we’re seeing the release of a new mid-range DPU in the form of the Mali-D37.
Introducing the Ethos NPU Family
Arm’s NPU IP offering was first announced early last year, detailing its architecture a few months later, and has been publicly been known until known just as “the Arm Machine Learning processor”. Arm at TechCon this year has officially branded the IP as the Ethos line-up, and the N77 has been the main product that’s been previously referred to as the Arm MLP codename.
Microarchitecturally, the new branded Ethos-N77 now publicly changes its specs compared to what had been revealed last year by allowing for a configurable 1 to 4MB SRAM implementation, whilst last year it had been disclosed it would scale up to 1MB only. Arm explains that customers needed more memory bandwidth for processing these mesh networked NPUs, as DRAM bandwidth doesn’t scale up in the premium segment as fast as the core count does. The flagship IP offers up to 4TOPS processing power at 1GHz clock and has a respectable 5TOPS/W efficiency.
Arm is able to use the same building blocks across the different IPs. The NPUs all share the same MAC computation engine (MCE) and programmable layer engines (PLE). The MCE consists out of 128 MAC units, as disclosed last year, and is paired alongside a PLE. An MCE and PLE, plus SRAM, make up a computation engine (CE), and this is the scaling block that differs between the N77, N57 and N37, coming in 16x, 8x and 4x configurations in terms of the CE count.
The mid-range and low-end variants are being released a lot faster than other new IP technologies because Arm is seeing a lot more interest in doing ML in cost-constrained devices where every mm² of silicon is important. Particularly features like smartphone face unlocking or DTV resolution upscaling are becoming commodity features.
The new NPUs have already been licensed and delivered to customers.
Revealing the Mali-G57 - First Mid-range Valhall Based GPU
Earlier this year, Arm had announced the new Valhall architecture in the new Mali-G77 that we’re expecting to see in SoCs next year. The new GPU architecture is a more major departure from the Bifrost based GPUs we’ve seen over the last three years as Arm has completely revamped its graphics ISA and computation microarchitecture.
Today, Arm reveals that the company is adopting the new Valhall architecture in the mid-range, starting off with the new Mali-G57. We currently don’t have too many details on exactly what the finer microarchitecture configurations of the new GPU looks like, but we’re very likely looking at something that will be very similar to the G77, scaled down similar to how the G52 looked like compared to the G72.
Improvements compared to a G52 with three execution engines per core (3EE) promise 1.3x better performance in a similar core configuration, 30% better energy efficiency, and 30% better silicon density (due to the better performance).
Mali-D37 DPU - Bringing High-End Features To the Mid-Range
Finally, to wrap things up, Arm is now bringing to market a new mid-range DPU in the form of the Mali-D37.
The new IP is based on the “Komeda” architecture which was first introduced in the Mali-D71 and its follow-up, the Mali-D77 announced this year. Then new DPU targets resolutions of 2K and FHD and promises to take up only <1mm² on 16nm.
Related Reading
- ARM Details "Project Trillium" Machine Learning Processor Architecture
- Imagination Goes Further Down the AI Rabbit Hole, Unveils PowerVR Series3NX Neural Network Accelerator
- CEVA Announces NeuPro-S Second-Generation NN IP
- Cadence Announces Tensilica Vision Q7 DS
- Arm's New Mali-G77 & Valhall GPU Architecture: A Major Leap
- Arm Announces Mali D77 Display Processor: Facilitating AR & VR
- Arm Announces New Mali-D71 Display Processor and IP Blocks
12 Comments
View All Comments
RallJ - Wednesday, October 23, 2019 - link
That 4.8 TOPs/w is for the entire SoC, meaning the NPUs are much more efficient than anything announce here. Also they operate at a much higher power and it's easier to be more efficient at the lower power range.name99 - Thursday, October 24, 2019 - link
Sure, sure.Interesting that you chose to attack the power efficiency (on technical grounds), rather than the shipping dates. Probably a wise choice...