Hacker News .hnnew | past | comments | ask | show | jobs | submit | decuran's commentslogin

Can you extract from wiktionary with this?


The answer is complicated (I am mentioning it in the midst of plans and possibilities in README: "(maybe?) other MediaWiki-based sites (like Wikvoyage, Wiktionary, etc.)").

In more details:

1. With the current library, you can't (connection to en.wikipedia.org is hardcoded :shrug:)

2. Adding "what MediaWiki instance to connect" is relatively easy (they all have the same API), including Wiktionary

3. For Wiktionary, current (v. 0.0.2) selectors will be useful, but future ones (infobox, wikitable and other high-level Wikipedia-specific stuff) will be not

4. So I might think of some way of specifying "set of selectors relevant for this instance" or something like that.

5. So, in some distance future, Wiktionary definitely would be fully supported; how distant—probably depends on how fast the project will be moving forward, and how it will be (hopefully) used by others

6. But I might add a support for "use this instance of MediaWiki" in the near future, it is easy!


I love this! I had no idea about the Etymological Wordnet and it probably would have saved me a ton of time developing my app for finding "interesting" cognates: https://etymologyexplorer.com

I've always loved the same thing—finding hidden connections between everyday words. I recently did this with "vain". It comes from Latin vanus, meaning "empty". More obvious with the "in vain" meaning, but the modern day comes from the idea of an exaggerated self image, with no substance behind it. It has a ton of "empty" cognates: vanish, evanescence, vanity (table), vaunt, vacuous, vacuum, vacation, void, devastate, wanton, wane


> It comes from Latin vanus, meaning "empty". More obvious with the "in vain" meaning

Fun fact: the Latin word for "in vain" is frustra.


I'm pretty sure that you can't have devastate in they list without mentioning waste.


If people are interested in diving deeper into etymologies, I made an app that allows you to visually explore all the roots, descendants, or relatives of a word: https://etymologyexplorer.com


Why do you think of etymonline.com ?


Yes, this is the site that inspired me! I was frustrated that it was so hard to navigate to cognates (word cousins) or from roots to descendants. So mine is focused on navigating between words like that


I think he tried to add cognates on some words but I never really caught up with that.


Looks cool! May use this for my toddler


I'm making a tool for exploring word etymologies.

I'm fascinated by the hidden connections in our language—like "galaxy" being connected to "galactose" because of the Milky Way.

I used NLP to convert written etymology paragraphs from wiktionary into a database of words and connections.

Also interesting are the Proto-Indo-European roots, like "h₂enh₁", which is a 10k-year-old reconstructed word meaning "breath", that has about 1k modern day descendants ranging from "animate" and "unanimous" to "anemone".

https://www.etymologyexplorer.com



Yes, this is the site that inspired me! I was frustrated that it was so hard to navigate to cognates (word cousins) or from roots to descendants. So mine is focused on navigating between words like that


Wow, I have been thinking lately about building something similar. Looks very nice!


Thanks so much! Would love any other feedback you have. I'm making updates often


I've been working on a tool that lets you see connections between words in your own language and one you're learning. It is based on following etymology trees up and back down. It has helped me a lot and I enjoy browsing around in it.

For example you can see the connection between Spanish 'mano' ("hand") and English 'manufacture', 'manipulate', etc.

https://www.etymologyexplorer.com


I made this because I'm really interested in how similar words relate to one another. I used NLP to convert written etymology paragraphs from wiktionary into a database of words and connections.

The most interesting thing has been finding Proto-Indo-European roots, like "h₂enh₁", which is a 10k-year-old reconstructed word meaning "breath", that has about 1k modern day descendants ranging from "nose" to "anemone".


deus ex ATM


Is there a repo to reproduce the results? I didn't see anything in the article


That's interesting about Xanthippe and the fancy-fancy horse names. I had recently learned about the etymology of Hippopotamus (water horse) so I was excited to see another use.

As a plug, I'm really into etymologies as well and made EtymologyExplorer for visually navigating roots and descendants https://www.etymologyexplorer.com/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: