Posts
7387
Following
805
Followers
546
Owner, operator and janitor of the Froth Zone and its associated services.
TypeScript and Windows apologist. Hated cars before it was cool.
I know that I know nothing.
@fixpoint yes, it only took them 3 weeks
0
0
1

the Idiot formerly known as Sam Therapy

I am now professionally unemployed 🎉
4
1
10
@mischievoustomato @pernia For that, I was using ollama and open-webui.
1
0
2
@mischievoustomato @pernia Well the 3n I just downloaded doesn't support image recognition unless I'm doing something wrong sadretard
1
0
2
@mischievoustomato @pernia Thank you, I will try that one and see if it's less horrid, though horrid is funny
1
0
2
@mischievoustomato @pernia deepseek doesn't support image input so you can't go @gunk is this real
1
0
2
@mischievoustomato @pernia It's the best model that takes images and runs on the 8 GB GPU in my server. I'm trying to find the least bad model to use as a fedi bot because funne
2
0
3

the Idiot formerly known as Sam Therapy

@waifu @zero @phnt @Junes @mischievoustomato @sendpaws the software is doing what it was intended to do, block scrapers. Is it perfect? Hell no, but I can attest to it working on improving the uptime of git.froth.zone
1
0
2

the Idiot formerly known as Sam Therapy

@phnt @Junes @waifu @sendpaws @zero @mischievoustomato Another good point in the default configuration. Someone who was smart would change the bot policy to be more aggressive and PoW everything, which you can do. Someone being DDoSed by bad actors would do that (or just do the HAProxy linked earlier, both valid choices)
https://anubis.techaro.lol/docs/admin/policies
0
0
1
@mischievoustomato @Junes @phnt @waifu @zero If you try hard enough you can bypass nearly anything, but is it worth it for what little you probably get in return? Probably not.
1
0
1

@mischievoustomato @Junes @phnt @waifu @zero they could, yes. They could also make their user agent something absurd, but they don’t. Why? Because it would look suspicious as fuck if they didn’t. Does that matter to some random git repository or open source wiki? Fuck no. Even the local LLM I’ve been messing with says that:

AI scrapers use normal browser user agents to:

  • Evade anti-bot detection.
  • Render dynamic content properly.
  • Ensure compatibility with APIs and servers.
  • Mimic real user behavior and reduce the risk of blocking.

However, this is just one part of a broader strategy (e.g., proxy rotation, request throttling, and behavioral mimicking) to avoid detection.

1
0
1
@sendpaws @Junes @phnt @waifu @zero @mischievoustomato you do realize that what you just sent and anubis are the same exact thing, right? They are both Proof-of-Work proxies. One of which just says 'based' instead of having a stupid anime girl on it.
2
0
4

the Idiot formerly known as Sam Therapy

Hard thinker, this AI model
Thanks Google
2
4
9
@mischievoustomato @Junes @phnt @waifu @zero The problem with that is the point of the crawlers using the regular UA is so they blend in, if they spell something wrong, they will stand out as an obvious bad actor.
That's why User Agent spoofers exist as anti-fingerprinting measures, it's more "private" to look like the most common browser.

the "normal" slurpers and other bad actors don't do that, a malicious one might
3
0
1

the Idiot formerly known as Sam Therapy

@waifu @Junes I do for git.froth.zone, yes. I kept running into downtime from AI scrapers slurping my shitty code and I saw it as an alternative not long after it was first unveiled. I could try disabling it to see if I keep getting slurped by AI trainers.
Malicious scrapers could bypass it by altering their user agent but that would only make them stand out even more and make it easier to just block them (eg. they spell Mozilla as Mozzarella would bypass Anubis by default but be blatantly obvious). Also none of the regular AI slurpers used by eg. OpenAI, Anthropic, Amazon, Alibaba, Google and Microsoft do that, they usually just use a standard browser user agent to hide mask their activity.

I put an annoyance over repeated downtime shrug_yui
1
0
1

the Idiot formerly known as Sam Therapy

@waifu It's me I got banned from telegram like 3 years ago and I wanted an account for some reason
2
0
1

the Idiot formerly known as Sam Therapy

@icst When it comes to super high performance stuff, not at all, afaik. Even when I was in uni pre-chatgpt most ML libraries really only used Python before it was popular and easy to use. Before Python, ML (especially the class I took on AI) was in Lisp.

Btw what I'm using rn is an open source Go program https://github.com/ollama/ollama making my 5070 Ti cry. It runs deepseek entirely on the GPU but the smaller openai model makes my CPU cry too
0
0
1

the Idiot formerly known as Sam Therapy

my CPU is also dying
1
0
1

the Idiot formerly known as Sam Therapy

making my GPU cry by asking about woodchucks
1
0
1

the Idiot formerly known as Sam Therapy

@SuperDicq Good point. Here goes nothing frierenhey
0
0
0
Show older