Hacker Newsnew | past | comments | ask | show | jobs | submit | UltraSane's commentslogin

Which is silly because you can easily just use OCR and screenshots to create DRM free versions of Kindle books.

Not to mention it’s as easy to download books from Anna’s Archive as it is to buy them from Amazon. It’s weird going through so much effort to lock down books people already paid for.

I wonder how much this is about making it difficult for people to migrate to another platform. I recently switched to Kobo and the reader is far superior to Kindle. I had a hell of a time moving my library though.


I suspect at least some of this comes from publisher pressure. An acquaintance works for one of the big global book publishers and his general sense from upper management is that they still hate having to sell digital books.

It feels like the last major media industry that is holding out against a "future" that has been here for a long time already.


It's all from external pressure. Amazon spending energy on ebook DRM is a negative ROI activity for them.

A vanishingly small % of would-be ebook buyers even know pirated ones exist, and an even smaller one knows how to get those onto their Kindle.

My wife buys dozens of ebooks per year on Amazon, her friends too. I'm guessing if I poll that group, none of them would even know where to start, nor care to.


"Piracy is almost always a service problem" is also true. I see a lot of people who were risen on a pirated .mp3 and .epub to move to the streaming platforms just because it's a bit more convenient.

Yes totally agreed. I pay for streaming music and for Youtube because the costs make sense to me for what I get.

I used to pay for Netflix but now that there's so many different streaming services I have returned to the high seas because we just don't watch enough shows (maybe 3-5 shows a year?), yet they are spread across different services that all cost $20/month now, so the costs don't make sense for us.

For books, honestly, I refuse to accept that an EPUB costs $25 when the hardcover version costs $30. I also have heard first-hand how little of that $25 goes to the author (for the average author, not for a famous one)..

I do try to buy digital books directly from authors when I can, which is increasingly an option from upcoming writers, but otherwise, yarrrr...


This applies to newspapers too — if you compare the print version to the online version of a newspaper you notice that there's a lot more attention paid to the paper version. Whereas the online version has all kinds of aggressive banners and ads.

I think it's a generational thing, for a lot of publishers the internet is this newfangled thing


It gets even weirder in the Netherlands were the book industry has created a cartel. They have a minimum price that you cannot go under.

Of course what happened is that lots of people just started to import English paperbacks bypassing all the local laws. The price difference was just insane.

Dutch people in general do not have an overinflated view of their own language like in France.


It is really easy to buy a book, cut the spine off and feed the pages into a sheet fed scanner.

This reminds me of college, where I used to take my textbooks to the local copy shop to get the pages sliced out and three-hole punched so I only had to carry around currently relevant chapters rather than 30 lbs of books.

As for e-books, long story short, my low-tech chop-and-punch method tended to be cheaper and/or more convenient than the available legal e-book options at the time.

I considered scanning, and even had access to a sheet-fed duplex scanner, but given that the only mobile device I had at the time, a 17" PowerBook G4, was both awkward as an e-book reader and heavier than the unbound printed pages I was carrying around, it wasn't worth the hassle.


I actually bought a special flatbed book scanner where the glass was flush with one side and scanned every page of a book and then returned it. Scanning was tedious but not too bad while watching a good show or movie and getting my money back felt so good. Adobe Acrobat Pro can convert 800MB of scanned pages into a 70MB PDF with searchable and copy-able text.

It's to stop people from seeding new books to shadow libraries. It's not as easy to find new books on AA as on Amazon.

Given how quickly full-quality releases of movies and TV shows appear after they're first streamed, this is surprising to me, at least so long as the PC and/or Android Kindle apps continue to exist.

What OCR do you guys use? I have only seen OCR that makes a lot of errors. Having it be usable requires tons of manual review. I probably wouldn't trust an LLM to do that review because it may introduce its own errors.

Edit: downvoters, would you like to answer my question? I would genuinely like to know. I thought based on the confidence of the comment above there must be a super accurate OCR I've never heard of, but after seeing the sibling comment I'm going to guess there isn't.


Stirling PDF https://github.com/Stirling-Tools/Stirling-PDF is a free self-hosted PDF tool that can do very accurate OCR while keeping the formatting.

Modern OCR is VERY accurate. Heck Adobe Acrobat Pro OCR was essentially perfect 20 years ago.

One of my hobbies is typesetting modern editions of a certain type of rare, obscure old books that were poorly typeset to begin with. Modern OCR—and I’ve tried plenty of tools—is still rather error prone in my application.

Can you name a good open source one? I have spent many hours in the current decade correcting OCR errors. Mostly tesseract.

OCR'd ebooks are universally trash. For one, all formatting is gone. Anything in the book other than ASCII characters will vanish. You lose links within the book and all other advanced features.

And OCR is generally just not accurate enough and still makes very visible mistakes throughout the text.

Have you read many OCR'd ebooks? I have, and every single one was massively inferior. Most I would consider barely readable.


I love it when formatting is removed. Some ebooks especially epub don't work well with alternative fonts somehow.

For books that you want to keep the formatting the best option is to use Adobe Acrobat Pro and its Editable Text and Images feature. This replaces the scanned letters with a custom TrueType font. I used this in college to scan textbooks and it worked really well. Modern OCR on books is incredibly accurate.

see https://www.youtube.com/watch?v=bhJ9zqY8Da0


Open-source, free version of this is Stirling PDF https://github.com/Stirling-Tools/Stirling-PDF where you can do very accurate OCR while keep the formatting.

Writing Opus 4.6 to 4.7 does make more sense for people who read left to right.

I’m impressed with anyone who can read English right to left.


Whoa! TIL! I struggled a bit to read this style at first, but felt it get easier after a few tries.

Right to Left English - read can, who? Anyone with [which] impressed am I.

English can be read in a different order than the normal order when the sentences contain words for which it is easy to guess whether they are agents or patients, e.g. when the agents are animate nouns and the patients are inanimate nouns, or when pronouns are used for the agents or patients.

Otherwise, the non-standard order can be understood incorrectly. While the distinction between agents and patients is the most important that depends on word order in English, there are also other order-dependent distinctions, e.g. between beneficiary and patient, when the beneficiary is not marked by a preposition, or between a noun and its attribute, e.g. "police dog" is not the same as "dog police" and unless there is a detailed context you cannot know what is meant when the word order is wrong.

English is one of the languages with the most rigid word order. There are languages, especially among older languages, where almost any word order can be used without causing ambiguities, because all the possible roles of the words are marked by prepositions, postpositions or affixes (or sometimes by accentuation shifts).


In my example, the RTL reading is indeed a misunderstanding. I even cheated, because it really should have been:

> Left to Right English - read can, who? Anyone with [which] impressed am I.

and the causation is wrong; instead of the ability being impressive, it's the impressive character than allows reading in the opposite order.

So, you're right, and now I'll wait for the dog police to come pick me up.


Yoda, you that is?

But the page is not in a language that should be read right to left, doesn't that make that kind of confusing?

Did you mean "right to left"?

I very much did, it got too confusing even for me. Thanks!

I kept mentally verifying that English is written left to right.

Err, how so?

If one person can do this then thousands should also be able to.

> If one person can do this then thousands should also be able to.

That’s not at all true. Very few people take the time to drive thousands of miles in a short period of time and document it while never intervening for any reason (not even accidentally bumping the wheel).

Anyways, I remember you. You claimed that Tesla would never remove safety drivers from their robotaxis.[1] I tried to get us to bet on this prediction but you never replied.

Well, Tesla has removed safety drivers from some of their robotaxis, meaning your prediction that their technology is “never going to be reliable enough” was falsified within a few months.

1. https://news.ycombinator.com/item?id=45661368


Tesla is on fire, took them 12 years to have few cars drive from point A to point B (not always safe but progress nonetheless)… suprised their P/E is not in the many thousands with this amazing feat they accomplished, well ahead of all of their competitors

> Very few people take the time to drive thousands of miles in a short period of time

Luckily that's not a requirement here. If your 1 example, your 1 datapoint isn't a fluke, then someone driving 100 miles should have 1/10th the interruptions as someone driving 1000 miles.

If you have to intervene 0 times in 1000 miles, and that isn't an outler, then we should expect to see 0 intervention required in all 100 mile drives, all 10 mile drives, and all 1 mile drives. Unless it's a fluke.

Statistics > 1 anecdotal post about a car on the car company CEO's website.


It is true because millions of people own Tesla's with FSD. Moreover if FSD really worked this well Tesla would be happily publishing this data at tesla.com/FSDstats . But instead Tesla is very secretive with their FSD data because it overall isn't very good.

"Tesla has removed safety drivers from some of their robotaxis"

Why doesn't Tesla has as many empty Robotaxis as Waymo has cars? Because Robotaxis aren't good enough. The entire Robotaxi rollout is a very carefully choreographed smoke and mirrors show to inflate Tesla share price. Tesla doesn't even have the permits to operate Robotaxis in California and probably never will because they are actually stricter vs Texas.


> Tesla doesn't even have the permits to operate Robotaxis in California and probably never will because they are actually stricter vs Texas.

I'd like to note that in just a few months, the goalposts have moved from "Tesla will never get rid of their safety drivers in their Robotaxis" to "Tesla will never operate Robotaxis in California."

As before, would you like to bet on your prediction? I'm willing to wager you any amount up to $1,000 that before the end of 2028, Tesla will have Robotaxis in California, available to the public, without a safety driver. Note that this may not require a permit for deployment, just driverless testing, as that is how Waymo currently operates.[1]

December 31st, 2028 may seem like a long ways off, but it's much sooner than never.

1. https://www.dmv.ca.gov/portal/vehicle-industry-services/auto...


"Tesla will never get rid of their safety drivers in their Robotaxis"

They didn't though, they just moved them to a chase car. Because Elon Musk is an incredibly successful con artist. Why isn't every robotaxi driverless and open to the public like Waymo is? Tesla is so far behind Waymo it is laughable.

https://www.reuters.com/business/autos-transportation/musk-t...

Tesla touts California robotaxis but does nothing to get permits

Documenting driverless testing miles is critical for a series of permits

Tesla logged zero miles last year in California for the sixth straight year

Waymo documented 13 million miles over a decade before securing driverless ride-hailing permit


cryptographically signed timestamped raw log data.

If Mythos is really as good as Anthropic says it is there is no way the NSA isn't already using it.

You mean.. their own version?

or that they’ve taken it from Anthropic?

Also: you don’t think the White House is privy to all of the data sources, dashboards and tools its subsidiaries have - do you…?


They either are running it in a NSA DC or have API access.

Legal documents, contracts, and court filings must continue to use Department of Defense because legally the Department of War doesn't exist.

Even just in chats with Opus 4.6 I noticed hitting limits so much faster.

Every company I have ever worked for in the US didn't use IPv6 and actually blocked it at the FW

The US has something like 80% of the world's IPv4 addresses, so they feel a lot less pressure to migrate.

I’ve worked for a company that was barely using its /16. I know several individuals, including myself, with personal /24s.

I recently released a /24 that I registered in 1992 and I hadn’t realized it was still mine.

ARIN was gonna charge me $100 to authenticate and recover the account, but once I asserted and notarized my letter of relinquishment, the process went real quick!


You could've recovered it and sold it for $7K, or rented it out for $500/month.

Dude /24s are worth at least $5,000

A /24 is currently worth between $5,000 and $9,000 USD. Did you get them a long time ago?

Yes, over 32 years ago. It was before ARIN and is considered a legacy block.

With your own /24 you can get an AS number and potentially BGP multi-home it.

Yes, I've already done that. I have the /24 tunneled to my home network.

That would be so cool to see your own AS in the global BGP table. I suppose I could still try this using IPv6

You can get that going pretty cheap through various RIPE LIRs!

None of which are any help when connecting to someone who doesn't have those.

I know, I'm just agreeing there's a ton of IP waste in the US. Early adopters were perhaps unjustly rewarded. InterNIC (before ARIN) would just about hand out IPs to anyone who could send an email.

During the dot-com crash I had to put a /16, some /17's and a /19 on one vlan and connected a 1U Linux box running Labrea Tarpit just so those ranges would respond to ping because InterNIC used to harass us for not utilizing all the space. They threatened a few times to take them back. AFAIK nobody nags like that any more. They probably should.

I worked for a state government agency that had a public /16

US is significantly above average in terms of adoption

Our freaky network admins rolled it out in our global corpo.

Was fun seeing IPv6 running for a few days without problems.


Then any similar regions in latent spaces would also count as CP, right?

It really is only as distressing as you let it be.

Thank goodness middle schoolers have fully developed adult emotional regulation, then.

How do you think kids develop emotional regulation exactly? Do you think it comes from hearing repeated "It's OK" after a child fell?

> How do you think kids develop emotional regulation exactly?

That depends on what they're exposed to, how much of it, and how quickly.

Too much nastiness and you get PTSD and catatonia, like kids in Gaza or child abuse victims. Or suicide.

Children need to experience bad stimuli, for sure. But that doesn't mean they can cope with unlimited really bad stimuli.

> Do you think it comes from hearing repeated "It's OK" after a child fell?

No. But it doesn't come from "you fucking idiot child, you can't even walk properly, you'll never amount to anything" either.


Well then they should be TAUGHT this. Because the technology isn't going away.

I believe you need to be taught that if you're under-developed you literally cannot be taught it yet.

(Alas I guess that might be true for this lesson too!)


The embarrassment the person suffers depends entirely on how they are raised to view nudity and sex.

How many cultures on the planet would not find "a photorealistic depiction of a specific teenager getting gangbanged" to be embarassing to said teenager?

Hopefully ALL of them when it becomes possible to effortlessly create fake images like this. It is the only reasonable long term reaction.

Kids already die from extended verbal abuse.

Why would we assume your scenario will occur with even more effective bullying if it isn't already with that?


how do you teach someone whose brain is still developing?

"hey kids, get used to being exploited sexually, as it would be too expensive to require massive multinational corporations to bother to regulate AI"


Teaching someone when their brain is still developing is the BEST TIME to teach them.

Open models cannot be regulated. The genie is out of the bottle permanently.


This is an almost sociopathic lack of empathy

You can leave the "almost" out. That was an absolutely deranged comment that completely ignores reality.

Expecting people to react to AI generated porn of them like it was real forever is what is actually absolutely derange.

The way people react to things isn't set in stone and people will HAVE to adapt to this technology.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: