Hacker Newsnew | past | comments | ask | show | jobs | submit | mritchie712's commentslogin

tldr: this caches your S3 data in EFS.

we run datalakes using DuckLake and this sounds really useful. GCP should follow suit quickly.


I was thinking of using it with Duckdb as well but seems it would be of limited benefit. Parquet objects are in MBs, so they would be streamed directly from S3. With raw parquet objects, it might help with S3 listing if you have a lot of them (shave off a couple of seconds from the query). If you are already on Ducklake, Duckdb will use that for getting the list of relevant objects anyway.

Maybe the OP is thinking of reading/writing to DuckDB native format files. Those require filesystem semantics for writing. Unfortunately, even NFS or SMB are not sufficiently FS-like for DuckDB.

Parquet is static append only, so DuckDB has no problems with those living on S3.


I am curious about this use case

How do you see it helping with DuckLake?


I think this is a move to get business users paying consumption (per token) pricing for codex instead of a flat rate.

How would that work, when it's the flat rate subscription they've reduced the price of?

Tighter limits under the auspices of "lower price!"

Yeah, the lede is buried a bit, these new rate-cards seem to be aligning towards token-based pricing with the prior rates now labeled 'legacy'

https://help.openai.com/en/articles/20001106-codex-rate-card https://help.openai.com/en/articles/11481834-chatgpt-rate-ca...


The prior rates in question there are message based pricing, not the flat rate subscription.

Hannes (one of the creators) had a pet duck


what's the use case for cortex? is anyone here using it?

We run a lakehouse product (https://www.definite.app/) and I still don't get who the user is for cortex. Our users are either:

non-technical: wants to use the agent we have built into our web app

technical: wants to use their own agent (e.g. claude, cursor) and connect via MCP / API.

why does snowflake need it's own agentic CLI?


When you say just Cortex it is ambiguous as there is Cortex Search, Agents, Analyst, and Code.

Cortex Code is available via web and cli. The web version is good. I've used the cli and it is fine too, though I prefer the visuals of the web version when looking at data outputs. For writing code it is similar to a Codex or Claude Code. It is data focussed I gather more so than other options and has great hooks into your snowflake tables. You could do similar actions with Snowpark and say Claude Code. I find Snowflake focus on personas are more functional than pure technical so the Cortex Code fits well with it. Though if you want to do your own thing you can use your own IDE and code agent and there you are back to having an option with the Codex Code CLI along with Codex, Cursor or Claude Code.


Because "stock price go up"?


claude code solved this about a month ago


I think you're reading it exactly right


It's half the price per token. Not all tokens are generated equally.


Neither are cars but Ill take a Porsche over a Ferrari for a fraction of the price.


What about a Porsche vs. a Toyota Camry for half the price?


which model?


For me any, tbh. I wouldn't fit in a Ferrari lol


We've (https://www.definite.app/) replaced quite a few metabase accounts now and we have a built-in lakehouse using duckdb + ducklake, so I feel comfortable calling us a "duckdb-based metabase alternative".

When I see the title here, I think "BI with an embedded database", which is what we're building at Definite. A lot of people want dashboards / AI analysis without buying Snowflake, Fivetran, BI and stitching them all together.


Not open source though?


us-east-2 has been down for over 2 hours now


And still down. In my case, no auth and no reads.

If this had happened prior to 4PM Eastern, I would have been screwed on my main early-stage project. I guess it's time to move up the timeline on real backend with failover.


> when you connect a warehouse like Snowflake, BigQuery, or Postgres

I'm curious what others are seeing connecting AI tools to Snowflake. Snowflake charges $3 per compute hour and it's pretty easy for an agent to run dozens of queries asynchronously.

As others have mentioned, if you want a notebook, compare this hard against Hex. It's unclear what LiveDocs would give you over Hex (cheaper maybe?).

ps - if you don't have Snowflake / data warehouse yet, we give you a full data platform (data lake + pipelines + dashboards + agent) at https://www.definite.app/.


Livedocs runs locally on your machine or on customer-managed infra, has full terminal access, supports canvas mode for building custom UIs (not just charts), and uses long-running agent workflows with sub-agents coordinating work over time, etc

There is a lot more to data work than just SQL + charts like the tool you mentioned


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: