> this is a “holy shit” moment for Rust in AI applications Yeah because I realiz... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

est on Nov 13, 2023 | parent | context | favorite | on: Fast and Portable Llama2 Inference on the Heteroge...

> this is a “holy shit” moment for Rust in AI applications

Yeah because I realized the 2MB is just a wrapper that reads stdin and offloads everything to wasi-nn API.

> The core Rust source code is very simple. It is only 40 lines of code. The Rust program manages the user input, tracks the conversation history, transforms the text into the llama2’s chat template, and runs the inference operations using the WASI NN API.

You can do the same using Python with fewer lines of code and maybe smaller executable size.

gumby on Nov 13, 2023 [–]

Pretty damning if 40 lines of rust to read stdin generates a 2 MB binary!

lakpan on Nov 13, 2023 | [–]

Presumably that also accounts for the WASM itself

gpderetta on Nov 13, 2023 | | [–]

Indeed. I hope it does include the WASM VM.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact