Posts
8037
Following
823
Followers
581
Owner, operator and janitor of the Froth Zone and its associated services.
TypeScript and Windows 11 apologist. Hated cars before it was cool. Libertarian hater.
I know that I know nothing.
All of my posts are official government opinions and those of your employer.
@mischievoustomato @Junes @phnt @waifu @zero If you try hard enough you can bypass nearly anything, but is it worth it for what little you probably get in return? Probably not.
1
0
1

@mischievoustomato @Junes @phnt @waifu @zero they could, yes. They could also make their user agent something absurd, but they don’t. Why? Because it would look suspicious as fuck if they didn’t. Does that matter to some random git repository or open source wiki? Fuck no. Even the local LLM I’ve been messing with says that:

AI scrapers use normal browser user agents to:

  • Evade anti-bot detection.
  • Render dynamic content properly.
  • Ensure compatibility with APIs and servers.
  • Mimic real user behavior and reduce the risk of blocking.

However, this is just one part of a broader strategy (e.g., proxy rotation, request throttling, and behavioral mimicking) to avoid detection.

1
0
1
@sendpaws @Junes @phnt @waifu @zero @mischievoustomato you do realize that what you just sent and anubis are the same exact thing, right? They are both Proof-of-Work proxies. One of which just says 'based' instead of having a stupid anime girl on it.
2
0
4
Hard thinker, this AI model
Thanks Google
2
4
9
@mischievoustomato @Junes @phnt @waifu @zero The problem with that is the point of the crawlers using the regular UA is so they blend in, if they spell something wrong, they will stand out as an obvious bad actor.
That's why User Agent spoofers exist as anti-fingerprinting measures, it's more "private" to look like the most common browser.

the "normal" slurpers and other bad actors don't do that, a malicious one might
3
0
1
@waifu @Junes I do for git.froth.zone, yes. I kept running into downtime from AI scrapers slurping my shitty code and I saw it as an alternative not long after it was first unveiled. I could try disabling it to see if I keep getting slurped by AI trainers.
Malicious scrapers could bypass it by altering their user agent but that would only make them stand out even more and make it easier to just block them (eg. they spell Mozilla as Mozzarella would bypass Anubis by default but be blatantly obvious). Also none of the regular AI slurpers used by eg. OpenAI, Anthropic, Amazon, Alibaba, Google and Microsoft do that, they usually just use a standard browser user agent to hide mask their activity.

I put an annoyance over repeated downtime shrug_yui
1
0
1
@waifu It's me I got banned from telegram like 3 years ago and I wanted an account for some reason
2
0
1
@icst When it comes to super high performance stuff, not at all, afaik. Even when I was in uni pre-chatgpt most ML libraries really only used Python before it was popular and easy to use. Before Python, ML (especially the class I took on AI) was in Lisp.

Btw what I'm using rn is an open source Go program https://github.com/ollama/ollama making my 5070 Ti cry. It runs deepseek entirely on the GPU but the smaller openai model makes my CPU cry too
0
0
1
my CPU is also dying
1
0
1
making my GPU cry by asking about woodchucks
1
0
1
@SuperDicq You're right, time to speedrun my dox
1
0
0
Running a local LLM exposed to the fediverse would probably be funny but you freaks would annihilate it with horrendous shit
3
0
1
@zonk I just got auto-rejected from a place in an hour, at least that is believable
0
0
1
@zonk wdym, hiring managers exclusively work on rejecting people at 03:00 on a Sunday, that's when everyone does it. At least that's what it seems like, when I get an email at 04:02 for a job I applied to the day before I definitely did not get auto-rejected from.
1
1
2
repeated
@sam i'm still getting rejections from internships i applied to in 2023 lmfao
1
3
11
@zonk They looked at your CV long and hard, please understand.
1
0
1
Rejection from a job I applied in April just landed in my inbox, 10/10 no notes
1
1
4
repeated

Yusarmi Donaltron (Total net worth: $6.14)

GNU/Windows operative system

2
1
3
Show older