Origin E2

Innovative Architecture

The Origin E2 neural engine uses Expedera’s unique packet-based architecture, which is far more efficient than common layer-based architectures. The architecture enables parallel execution across multiple layers achieving better resource utilization and deterministic performance. It also eliminates the need for hardware-specific optimizations, allowing customers to run their trained neural networks unchanged without reducing model accuracy. This innovative approach greatly increases performance while lowering power, area, and latency.

Customizable

Power-Efficient

Sustainable Performance

Easy to Deploy

Field-Proven

Choose the Features You Need

Customization brings many advantages, including increased performance, lower latency, reduced power consumption, and eliminating dark silicon waste. Expedera works with customers to understand their use case(s), PPA goals, and deployment needs during their design stage. Using this information, we configure Origin IP to create a customized solution that perfectly fits the application.

One of the world's leading smartphone manufacturers wanted to deploy a 4K video low light denoising AI algorithm on its next-generation platform. Their current generation NPU could process only a few frames per second (FPS) and wasn’t up to the task. The manufacturer selected Expedera’s Origin NPU IP because it exceeded all expectations and outperformed every other NPU they evaluated. It increased FPS by 20X while using less than half the power of the former NPU—improving PPA by 40X—enabling them to deliver a competitive-differentiated smartphone. Origin’s impressive performance gains and power efficiencies resulted from its efficient architecture and use case customizations. The manufacturer includes Origin IP in a series of successful products.

Features

Specifications

Compute Capacity	0.5K to 10K INT8 MACs
Multi-tasking	Run Multiple Simultaneous Jobs
Power Efficiency	18 TOPS/W effective; no pruning, sparsity or compression required (though supported)
Example Networks Supported	ResNet, MobileNet, MobileNet SSD Inception V3, RNN-T, BERT, EfficientNet, FSR CNN, CPN, CenterNet, Unet, YOLO V3, YOLO V5, ShuffleNet2, others
Example Performance	MobileNet V1 (226 x 226): 8750 IPS, 13,482 IPS/W (N7 process, 1GHz, no sparsity/pruning/compression applied)
Layer Support	Standard NN functions, including Conv, Deconv, FC, Activations, Reshape, Concat, Elementwise, Pooling, Softmax, others. Programmable general FP function, including Sigmoid, Tanh, Sine, Cosine, Exp, others, custom operators supported.
Data types	INT4/INT8/INT10/INT12/INT16 Activations/Weights FP16/BFloat16 Activations/Weights
Quantization	Channel-wise Quantization (TFLite Specification) Software toolchain supports Expedera, customer-supplied, or third-party quantization
Latency	Deterministic performance guarantees, no back pressure
Frameworks	TensorFlow, TFlite, ONNX, others supported

1-20 TOPS performance	Support for standard, custom, and proprietary neural networks
Performance efficiencies up to 18 TOPS/Watt	Full software stack provided, including compiler, estimator, scheduler, and quantizer
Runs LLM, CNN, RNN, DNN, LSTM, and other network types	Delivered as Soft IP (RTL) or GDS

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Search

Balanced Performance for AI Inference

On-device AI is a must-have for many new designs. Silicon architects look for solutions that support the latest AI technologies, like transformers and stable diffusion, while balancing performance and low power consumption with minimal latency.

Ideal for Edge AI

Innovative Architecture

Choose the Features You Need

Market-Leading 18 TOPS/W

Efficient Resource Utilization

Full TVM-Based Software Stack

Successfully Deployed in 10M Devices

Use Case

A Better Smartphone User Experience

Download our White Papers

Get in Touch With Us

Origin E2

Balanced Performance for AI Inference

On-device AI is a must-have for many new designs. Silicon architects look for solutions that support the latest AI technologies, like transformers and stable diffusion, while balancing performance and low power consumption with minimal latency.

Ideal for Edge AI

Innovative Architecture

Choose the Features You Need

Use Case

A Better Smartphone User Experience

Download our White Papers

Get in Touch With Us

STAY INFORMED

Subscribe to our News