5.
Deployment
search
Quick search
code
Show Source
All Notebooks
PDF
Discuss
GitHub
Table Of Contents
1. Getting Started
1.1. Installation
1.2. Vector Add
1.3. Neural Network Inference
1.4. Running on a Remote Machine
2. Expressions for Operators
2.1. Data Types
2.2. Shapes
2.3. Index and Shape Expressions
2.4. Reduction Operations
2.5. Conditional Expression:
if-then-else
2.6. Truth Value Testing:
all
and
any
3. Common Operators
3.1. Broadcast Add
3.2. Matrix Multiplication
3.3. Convolution
3.4. Depthwise Convolution
3.5. Pooling
3.6. Batch Normalization
Operator Optimizations on CPUs
1. CPU Architecture
2. Function Call Overhead
3. Vector Add
4. Broadcast Add
5. Matrix Multiplication
6. Improve Cache Efficiency by Blocking
7. Convolution
8. Packed Convolution
9. Depthwise Convolution
10. Pooling
11. Batch Normalization
Operator Optimizations on GPUs
1. GPU Architecture
2. Vector Add
3. Broadcast Add
4. Matrix Multiplication
5. Convolution
6. Depthwise Convolution
7. Pooling
8. Batch Norm
4. Neural Networks
5. Deployment
References
Table Of Contents
1. Getting Started
1.1. Installation
1.2. Vector Add
1.3. Neural Network Inference
1.4. Running on a Remote Machine
2. Expressions for Operators
2.1. Data Types
2.2. Shapes
2.3. Index and Shape Expressions
2.4. Reduction Operations
2.5. Conditional Expression:
if-then-else
2.6. Truth Value Testing:
all
and
any
3. Common Operators
3.1. Broadcast Add
3.2. Matrix Multiplication
3.3. Convolution
3.4. Depthwise Convolution
3.5. Pooling
3.6. Batch Normalization
Operator Optimizations on CPUs
1. CPU Architecture
2. Function Call Overhead
3. Vector Add
4. Broadcast Add
5. Matrix Multiplication
6. Improve Cache Efficiency by Blocking
7. Convolution
8. Packed Convolution
9. Depthwise Convolution
10. Pooling
11. Batch Normalization
Operator Optimizations on GPUs
1. GPU Architecture
2. Vector Add
3. Broadcast Add
4. Matrix Multiplication
5. Convolution
6. Depthwise Convolution
7. Pooling
8. Batch Norm
4. Neural Networks
5. Deployment
References
5.
Deployment
ΒΆ
A place holder
Previous
4. Neural Networks
Next
References