MT9068

Computing Power: Delivers 512 TOPS of INT8 compute performance, optimized for deep learning training and inference in high-demand cloud and data center environments.

Detailed information

  • Computing Power: Delivers 512 TOPS of INT8 compute performance, optimized for deep learning training and inference in high-demand cloud and data center environments.

  • Architecture: Features an innovative multi-engine design with enhanced dataflow, execution models, and memory management, achieving higher compute density and efficiency while reducing external bandwidth requirements compared to traditional GPUs.

  • Interface: Full-height, full-length PCIe Gen3 standard form factor ensures seamless integration with servers and enterprise infrastructure, maximizing data transfer stability and throughput.

  • Software Support: Provides CUDA/OpenCL hardware acceleration with full compatibility for existing software ecosystems, minimizing development friction and enabling seamless integration into legacy systems.

  • Memory: Utilizes advanced manufacturing processes to support up to 128GB high-bandwidth memory, excelling in memory-intensive workloads such as large model inference and real-time analytics.

About Us

We are a leading provider of artificial intelligence (AI) chip acceleration solutions spanning cloud and edge computing environments. Through innovative computing architecture design, we deliver deep support for the development and deployment of end-to-end sparsified neural networks, building universal AI computing platforms that combine ultra-high computational throughput with ultra-low power efficiency.

Contact Us

barrack@mathews-tech.com.cn