Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
hugodutka
3 months ago
|
parent
|
context
|
favorite
| on:
Show HN: Zerox – Document OCR with GPT-mini
I think so. I'd normalize the text first: lowercase it and remove all non-alphanumeric characters. E.g for the phrase "What now?" I'd create these trigrams: wha, hat, atn, tno, now.
Consider applying for YC's W25 batch! Applications are open till Nov 12.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: