More

objektif · 2026-05-09T13:09:41 1778332181

This is pretty insightful thank you. Which provider are you guys using? Is it also over the phone or fully web/app based. Do you have any resources you can point me to learn about this?

aenis · 2026-05-09T14:22:38 1778336558

We use a bunch, at the moment we mainly self host (and use pipecat) use Daily, and a few niche boutique suppliers who built things for us.

There is a great resource for learning this stuff - the CEO of Daily, Kwindla Kramer, hosted a series of 1hr sessions on low latency voice ai. Here:

https://youtube.com/playlist?list=PLzU2zoMTQIHjMPZ-OnpC3ozZs...

Some of this is a bit outdated but most of it is very valuable.

Kwindla posts a lot of extremely useful stuff on x and linkedin, incl. working, easily replicable sub 500ms setups.

objektif · 2026-05-09T14:35:54 1778337354

Beautiful thanks. We are also looking at this and another complication is transcripts can get pretty messy updates, corrections etc.

objektif · 2026-04-23T18:10:49 1776967849

Are there faster mini/nano versions as well?

tedsanders · 2026-04-23T18:15:07 1776968107

Not this time, no.

abi · 2026-04-23T18:20:38 1776968438

Usually, those get released a few weeks later.

objektif · 2026-04-22T19:21:18 1776885678

Does anyone know good provider for low latency llm api provider? We tried to look at Cerebras and Groq but they have 0 capacity right now. GPT models are too slow for us at the moment. Gemini are better but not really at same level as GPT.

spmurrayzzz · 2026-04-23T02:15:55 1776910555

This depends a bit on your cost sensitivity and what model families you want support for, but Baseten and Fireworks have been my goto.

Currently Baseten has ~610ms TTFT and ~82 tk/s for Kimi K2.6, which is roughly 2x the throughput of GPT-5.4 (per their openrouter stats). GLM 5 is slightly slower on both metrics, but still strong.

objektif · 2026-04-08T01:17:52 1775611072

No. They like stealing land.

objektif · 2026-04-01T02:45:57 1775011557

What are you basing how good they are on? Personal experience or some benchmarks?

a-t-c-g · 2026-04-01T03:55:36 1775015736

Benchmarks, we have internal ones testing reasoning fine-tuned v/s frontier + prompts

For some use cases it can be parity performance at 1/20th the cost up to exceeds at 1/10th the cost. Trade-off is ofc narrow applicability

objektif · 2026-04-01T12:51:15 1775047875

How can I learn more about these models? Are they open source?

a-t-c-g · 2026-04-01T17:25:59 1775064359

there are plenty of OSS finetuned models + base models around. If you're looking for doing these on your own dataset, worth getting in touch with cartesien.io or wire up https://github.com/SalesforceAIResearch/PretrainRL-pipeline

objektif · 2026-04-01T19:13:18 1775070798

Thank you.

objektif · 2026-02-28T13:47:54 1772286474

Yeah we care about Iranian protesters you got this right.

mupuff1234 · 2026-02-28T14:08:45 1772287725

That's not what I said.

objektif · 2026-02-27T01:42:05 1772156525

Will Randian tech bros start calling for socialism soon? Inshallah.

objektif · 2026-02-22T03:17:12 1771730232

He sounds greedy as fuck. He speed ran buggy POS to sell to model co? Obvious as day what is there to see?

objektif · 2026-02-22T03:12:48 1771729968

PG commissioned dan on X to send anyone who criticize Andrej or Pete to gulag.

objektif · 2026-02-21T13:47:57 1771681677

Anyone using claws for something meaningful in a startup environment? I want to try but not sure what we can do with this.

alansaber · 2026-02-21T14:16:05 1771683365

PR. Say you fired all your friends and replaced them with mac minis.

objektif · 2026-02-21T16:06:56 1771690016

Haha good point? Once I do how much money can I raise on my Series Z?