July 19, 2018autoSmry bot nlp text
1. You may now just enter a URL into the textbox and autoSmry will read that entire page instead of requiring you to copy & paste a webpage’s contents into the textbox (URL must point to a html file).
To all the people who suggested adding the ability to parse the URL directly… this made a huge improvement on ease of use (for me at least). Thanks for the suggestion!
The original limitations still apply however. If the webpage is too long, it will timeout.
Also, while it tries to only read the main content, it does sometimes screw it up. You can check for any odd shapes in “Sentence Relationships”, as any odd shapes will likely indicate that it has unrelated text in there somewhere. The odd shapes may affect the quality of the summary. (Odd shapes to me are anything that resembles a scary insect…). The algorithm is robust enough to recognize and ignore completely unrelated sentences, but if there are a lot of them, it may start incorporating it.
2. I added additional support for other languages.
This one is still experimental in my mind. What I did was apply the same sort of preprocessing that I did for the English version (minus some English-specific ones) to a few more languages. This should theoretically result in much better sentence choices than before.
Testing this however is extremely labourious for me since I only know English. I would appreciate any feedback on how it’s working for the following languages:
If the results aren’t terrible… I’ll look into adding additional language support.
- Enter a URL directly; no more copy/pasting from a webpage
- Too long; didn’t read = trop long; n’a pas lu = zu lang; habe nicht gelesen = demasiado largo; no leí
Demo of updates
Articles tested in the video demo: