mrciffa's comments

mrciffa · 2025-02-09T12:09:26 1739102966

It should work with any type of model, obviously longer chain of thoughts will be more difficult to analyse by the evaluation model, because it will have way more reasoning steps to identify and separate. The quality of the outcome depends a lot on the chosen model to give you insights. We tested with Llama3-70B and worked smoothly most of the times.

mrciffa · 2025-02-09T12:05:28 1739102728

We are currently giving broad suggestions with an insight model that can be chosen during the setup. We will try to update and improve the suggestion prompt/code to make them more granular with new releases

mrciffa · 2025-02-04T11:08:38 1738667318

Yes, reasoning models can potentially be optimized with our uncertainty estimations. We are currently testing the library with DeepSeek R1

mrciffa · 2025-02-03T19:42:21 1738611741

Unfortunately LLMs are a gigantic monster to understand, we were considering your same approach with sliding window and we will try to keep the library updated with better and more reliable approaches based on new research papers and our internal tests.

mrciffa · 2025-02-03T19:38:50 1738611530

Apache-2.0 is correct one

mrciffa · 2025-02-03T19:34:06 1738611246

Exactly! Uncertainty is critical to correctly evaluate LLM performance and we don't need reasoning models to spend thousands of tokens on simple questions

mrciffa · 2025-02-03T16:50:57 1738601457

We want to integrate reasoning models as next steps because we see a lot of value in understanding better CoTs behaviour (DeepSeek R1 & Co)

andreakl · 2025-02-03T17:07:37 1738602457

Okay thanks that sounds great, have u also thought about extending the scope beyond language models?

mrciffa · 2025-02-03T15:56:37 1738598197

Oh damn, you are right. It's my first opensource project and I didn't thought about it

dleeftink · 2025-02-03T16:17:19 1738599439

You'll get there! Even if a commit doesn't have peculiars, just try to include the reason for making a change.

wruza · 2025-02-03T16:30:37 1738600237

Not all people (and/or not in all development phases) granulate commits to something easily describable that is not “update code”. Having mass changes or flow of consciousness style refactorings in a single commit is absolutely normal.

An author doesn’t need to please a repo reader until they see a good reason to do so.

zxvkhkxvdvbdxz · 2025-02-03T19:00:36 1738609236

Indeed, that's how most of my project commit logs look like in the startup phase. Eventually i make a commit with a "MVP" message and then I try to go from there with meaningful messages.

dleeftink · 2025-02-03T17:20:27 1738603227

Agree! The 'clean commit' is an ideal, not a reality. I just know that looking back on some of my own repo's that I should've included a little more reasoning context, if only intermittently..

mrciffa · 2025-02-03T15:56:02 1738598162

In the example I'm using the instruction tuned version of Qwen2.5-7B to generate the insights