73 points by davidklemke 4 days ago | 23 comments | View on ycombinator
dvt about 4 hours ago |
geerlingguy about 6 hours ago |
- Commodity Arm SoC (or sometimes N100 or N150 x86)
- 8/16/32 GB of LPDDR5x RAM
- 'NPU' (usually unspecified) with ambiguous 'TOPS' number (like 20, 40, 80)
Usually specifics aren't provided, and TOPS is never defined in a technically useful way. The few times it is, are from more established companies (e.g. Asus or Raspberry Pi integrating a well-known NPU chip into one of their products).It's worse at this point than the peak of the crypto boom, when I was getting emails touting the next chain-of-proof software, or ledger-this/ledger-that. Now that there are a few actual use cases for this hardware, it requires more nuance to separate the wheat from the chaff.
And for me, I spend weeks, typically, with any hardware I _do_ review, running as many models and test runs as I can (and documenting everything on GitHub, in depth, with scripts so other people can verify). Most reviewers (like those with publications named in this post) either don't have the time, or sadly, the understanding, to test these devices in a meaningful way.
Therefore, random blog posts (which are getting harder and harder to find, amidst the AI-laden first 2-4 pages of DuckDuckGo and Google results) are the best source of information. Or sometimes a post on Mastodon, which is never easy to find since search isn't a thing there.
Edit: Ah, they did reach out around CES time. Funny seeing their pitch deck including a note on Dr. Miles Mi, with a row of logos on that page including Apple, MIT, Berkeley, DJI, VIVO, Tuya, and a few others, as if they were using this project or something?
shrikaranhanda about 2 hours ago |
fwipsy about 7 hours ago |
There you go, two sentences without burying the lede.
Is it maybe competitive value anyways though? Even if you only think of the accelerators, 48gb+160TOPS seems comparable to some Strix Halo mini PCS with 64gb - lower memory bandwidth but a few hundred dollars cheaper. If they sold just the accelerator card for $800 or something that would be potentially very interesting.
lurkshark about 3 hours ago |
https://www.kickstarter.com/projects/tiinyai/tiiny-ai-pocket...
gnabgib about 7 hours ago |
Including questions of LLM origin. Seems like the OP might have submitted that one (47431685) although there's another copy now (beyond this SCP entry from 3 days ago)
smartbit 3 days ago |
Will be interesting to see if a public outcry will happen once these boxes start arriving at those who funded the kickstarter.
VladVladikoff about 5 hours ago |
Flagged.
neuroelectron about 6 hours ago |
DeathArrow about 1 hour ago |
>For perspective: a consumer NVIDIA RTX 4060 Ti (~$400) can run comparable 3B active-parameter MoE workloads at 70–90 tok/s with 100K+ context, depending on setup. The Pocket Lab lands around 6–12 tok/s at 8K–32K context.
>Same class of workload. Roughly 5–10× slower, at 3× the price, with tighter constraints.
Which is fine, but please disclose it. Otherwise, like in this case, I'm going to assume the author is a moron that can't write for shit who thinks their readers are morons that can't read for shit.