domi

Opencode llama-server prefill/generation stats plugin

3d 16h ago in localllama@sh.itjust.works from codeberg.org

Thanks! This might come in handy since I have the same issue with the generic "Thinking" in opencode.

It does? Guess I can finally yeet Chromium from my machine then.

DenuvOwO is coming to Linux

9d 22h ago in piracy@lemmy.dbzer0.com from lemmy.ca

According to their now deleted Reddit account, it will use a custom Proton build on Intel and Zen 4 CPUs. For every other CPU you will need a kernel module.

Email ownership, I give up.

10d 3h ago in selfhosted

If you have trouble with outgoing mails, you can use a hybrid approach.

Receive mails directly to your server but use a mail service to relay your outgoing mails. Configuration for that is very simple in mailcow and there are a few dozen (free) transactional email providers (e.g. Scaleway).

That way you can keep receiving your mails privately and only have to give up some privacy when sending mails.

New Jellyfin Server/Web release: 10.11.11

11d 13h ago in jellyfin@lemmy.ml from forum.jellyfin.org

You should be able to just stop Jellyfin, drag the new version into your Application folder and start it again.

How did you install Jellyfin?

I also started a Twilight Princess playthrough in dusklight recently. Even though I only wanted to take a quick look around, I'm already in the Arbiter's Grounds right now.

I highly recommend dusklight, very well made decompilation: https://github.com/TwilitRealm/dusklight

Roku OS’s home screen now features a large, permanent ad

21d 8h ago in technology from arstechnica.com

Plasma Bigscreen is scheduled for its first release next month: https://plasma-bigscreen.org/

Ask again in a different support ticket.

Friend of mine just got told that it's going to arrive soon and then asked again after he saw everyone got a free game and now he also got the offer.

As a side note, Qwen3.6-27B is much more capable than Qwen3.6-35B, even though it is much slower.

https://huggingface.co/unsloth/Qwen3.6-27B-GGUF

For coding tasks where you don't mind waiting, you should be able to barely squeeze in the 8-bit quantized version with 32 GB RAM + 8 GB VRAM and have a pretty competent local model. 4-bit quants work but they have issues with complex tool calls.

If you use the MTP branch of llama.cpp (and a suitable model) you can even double or triple your token generation speed: https://github.com/ggml-org/llama.cpp/pull/22673

For easier tasks, disable reasoning for instant responses.

Full-text search proxy for Jellyfin

1y 11mon ago in jellyfin@lemmy.ml from gitlab.com

Plasma 6 - Turning off display after screen is locked

2y 3mon ago in kde@lemmy.kde.social

Wlazny will mit Bierpartei bei NR-Wahl antreten

2y 4mon ago in austria@feddit.de from orf.at

Spare Signals From The Outer Wilds vinyl

2y 10mon ago in outerwilds from files.catbox.moe

Episode discussions for Lemmy

3y 24d ago in anime@lemmy.ml