Question 1

What is the cheapest Mac to run Llama 3.1 8B?

Accepted Answer

A used Mac mini M1 8c/8c 16GB (2020) at roughly $436 runs Llama 3.1 8B at Q8_0 quantization, generating about 3.8 tok/s.

Question 2

How much RAM does Llama 3.1 8B need?

Accepted Answer

About 5GB resident at Q4_K_M quantization (weights plus overhead), so a machine needs roughly 7GB of unified memory to run it comfortably.

Question 3

How good is Llama 3.1 8B?

Accepted Answer

It scores 17 on the general intelligence index (Artificial Analysis scale) and 14 on the coding scale (SWE-bench), interpolated from published benchmark relationships.

Machine	Used price	Runs at	Est. speed	Max context
Mac mini M1 8c/8c 16GB (2020)	$436 EST.	Q8_0	3.8 tok/s	27K tokens
Mac mini M4 10c/10c 16GB (2024)	$461 EST.	Q8_0	6.8 tok/s	27K tokens
Mac mini M2 8c/10c 16GB (2023)	$499 EST.	Q8_0	5.7 tok/s	27K tokens
Mac mini M4 10c/10c 24GB (2024)	$615 EST.	FP16	3.6 tok/s	15K tokens
Mac mini M2 8c/10c 24GB (2023)	$624 EST.	FP16	3.0 tok/s	15K tokens
Mac mini M4 10c/10c 32GB (2024)	$769 EST.	FP16	3.6 tok/s	61K tokens
Mac mini M2 Pro 10c/16c 16GB (2023)	$812 EST.	Q8_0	11 tok/s	27K tokens
Mac mini M2 Pro 12c/19c 16GB (2023)	$999 EST.	Q8_0	11 tok/s	27K tokens
Mac mini M2 Pro 10c/16c 32GB (2023)	$1,062 EST.	FP16	6.0 tok/s	61K tokens
Mac mini M4 Pro 12c/16c 24GB (2024)	$1,077 EST.	FP16	8.2 tok/s	15K tokens

Cheapest Mac to run Llama 3.1 8B

Every Mac that runs it, by used price

Run it

Or skip the hardware