AI Chips

Uniquely-designed AI chips
optimized for faster, more efficient
training and inference

AI development and usage currently depend heavily on general-purpose GPUs, but the rapid rise of generative AI is pushing these chips to their limits in performance, cost, power efficiency and availability. To sustain AI’s evolution and practical implementation, specialized semiconductors optimized for AI computing are essential.

Since 2016, PFN has been developing the MN-Core™ processor series with Kobe University—AI-dedicated chips that deliver high speed and efficiency for training and inference. Designed from the ground up with a completely different architecture from general-purpose GPUs, the MN-Core series specializes exclusively in the computations required for AI, achieving outstanding processing performance and efficiency.

Contacts

Contacts

AI Solutions and Products

Generative AI Foundation Models

Computing Infrastructure

Processor for generative AI inferenceMN-Core L1000

As the first model in the MN-Core L Series, the MN-Core L1000 is a processor under development designed specifically for generative AI inference. Unlike conventional processors that place memory and logic side by side, the L1000 adopts a 3D-stacked architecture that vertically stacks memory and logic. This structure enables significantly greater memory bandwidth than the high-bandwidth memory (HBM) used in current high-end GPUs.

In addition, while many recent AI processors rely on SRAM (static random access memory), the L1000 employs large-capacity, cost-efficient DRAM, achieving both high speed and large memory capacity at lower cost. As a result, the L1000 can process more tokens per second during generative AI inference, delivering up to 10× faster performance compared to existing GPUs and other processors.

Product site

MN-Core L1000, a processor for generative AI inference (mockup)

AI training and high-performance computing processorMN-Core 2（second generation）

MN-Core 2 is the second generation model in the series that boasts a top-class energy efficiency. Compared with the first-generation MN-Core, the second generation has larger memory bandwidth and smaller size, allowing high-density positions in compact blades.

	MN-Core 2	MN-Core 2 (power efficiency)
FP64	12 TFlops	37.24 GFlops/W
FP32	49 TFlops	148.9 GFLops/W
TF32	98 TFlops	297.9 GFlops/W
TF16	393 TFlops	1,192 GFlops/W

Products equipped with MN-Core 2

MN-Server 2

5U rack-mount server equipped with eight MN-Core 2 boards

Model number: MNS2V1
AI accelerator: MN-Core 2 (8 boards)
Theoretical performance: TF16 3.1PF
Standard price: 20 million yen (excluding tax)

MN-Core 2 Devkit

A desktop machine equipped with a single MN-Core 2 board. Compact and easy to install in an office environment, it offers a simple way to experience AI acceleration powered by MN-Core 2.

AI accelerator: MN-Core 2 (1 board)
Theoretical performance: TF16 393TF

	MN-Core 2 Devkit (power efficiency)	MN-Core 2 (power efficiency)	Standard price
MN-Core 2 Devkit	MN-Core 2 Devkit (hardware unit only)	MNC2DV1	2 million yen (excluding tax)
MN-Core 2 Devkit Basic package	Package including an MN-Core 2 Devkit, delivery, installation and initial setup	MNC2DV1bk	2.5 million yen (excluding tax)

*MN-Core 2 Devkit basic package is provided in Japan only.

AI training and high-performance computing processorMN-Core (first generation)

The first-generation MN-Core consists of four dies integrated into one package. With each die has 512 MABs, the processor has a total of 2,048 MABs. Developed in the TSMC 12nm process, the first-generation MN-Core has higher peak performance and energy efficiency than other accelerators with the same process.

In 2020, PFN began operating MN-3, a supercomputer powered by 160 MN-Core processors connected with a specialized interconnect. MN-3 has topped the Green500 list of the world’s most energy-efficient supercomputer multiple times.

Estimated power consumption (W)	500
Peak performance (TFLOPS)	32.8 (DP) / 131 (SP) / 524 (HP)
Estimated performance per watt (TFLOPS / W)	0.066 (DP) / 0.26 (SP) / 1.0 (HP)

MN-Core Series
Use Cases

The MN-Core series processors have shown significantly higher performance for actual AI workloads than GPUs thanks to its high energy efficiency and high peak performance.

AI-based 3D reconstruction

First-generation MN-Core achieved a tenfold increase in speed for neural network-based reconstruction of thousands of 3D models from 2D images for PFN 3D Scan. (1st-generation MN-Core)

Automatic optimization of image recognition model

Automatic optimization of the image recognition model for Kachaka, an autonomous mobile robot that are currently sold in Japan, was seven times faster when powered by MN-Core™ than GPU. (1st-generation MN-Core)

AI-accelerated materials discovery

The speed for neural network-based atomistic simulation of new materials on Matlantis™ was over five times higher with MN-Core than GPU. (1st-generation MN-Core)

Through PFN’s cloud service PFCP™, the computing power of the second-generation MN-Core 2 was experimentally used in Matlantis, confirming faster performance than GPUs in atomic-level simulations of low-atom-number systems.

Low-atom-number simulation using PFCP

Comparison with GPU (vertical axis: simulation time, horizontal axis: number of atoms)

Downloads

MN-Core Series
Roadmap

PFN foresaw the rapid rise in demand for AI chips and launched development of the first-generation MN-Core™ processor in 2016. We continue to advance development and commercialization efforts today.

MN-Core Series: The Architecture

By shifting control and other hardware functions to software, the MN-Core series maximizes the proportion of arithmetic units on the hardware, achieving exceptional performance and power efficiency.

MN-Core has matrix arithmetic units (MAUs) densely mounted in its hardware architecture. Entirely composed of SIMD (single instruction, multiple data) with no conditional branch, the simple architecture maximizes the proportion of arithmetic units on the semiconductor area. The MABs (matrix arithmetic blocks), each consisting of four PEs (processor elements) and one MAU (matrix arithmetic unit), have a hierarchical structure. This allows flexible programming as each hierarchical level can have multiple modes such as scatter, gather, broadcast and reduce.

PFN also develops a compiler specifically for the MN-Core series so that users can harness its full potential without making major changes to existing AI workloads. The MN-Core compiler generates and supplies optimal instructions and moves data from computational graphs defined with high-level languages such as PyTorch and JAX. To efficiently perform different levels of processes from computational graph-level control to low-level instruction generation, the MN-Core compiler divides the problems according to their levels of abstraction and thus consists of components that make it easy to make improvements at the respective level.

Join the PFN team

Careers

Join the PFN team

Careers

Contact us for inquiries on
our products, solutions and R&D

Contacts

Contact us for inquiries on
our products, solutions and R&D

Contacts

AI Chips

Uniquely-designed AI chips
optimized for faster, more efficient
training and inference

Processor for generative AI inferenceMN-Core L1000