Free Simple Keyword Parser

Today, I have a small task require a HTML Keyword Parser, the objective is to produce the meaningful keywords from HTML page, but I'm just done it with a very simple version. This very basic version just cut the long sentence into keywords, count how many time it appears and yes, calculate the keyword's density. As I told you, this is the simple version, but I think it suitable for who needs to count, calculate density of each SINGLE keyword in an article, paragraph.

Let me show you how it works, then you can try my Simple HTML Keyword Parser for free :-)

How the Simple HTML Keyword Parser works

Very simple, in this basic version, I just break down the sentence by any non-alphanumeric character to have a list of potential keywords. Then I loop through those potential keywords and check if the keywords is numeric (whole keyword is only contains digits) move it out, after this loop, we will have a list of "more potential" keywords.

In my opinion, those keywords is "potential" enough to return to you, if you think I miss some other surely-non-keyword, please comment for me, I will put it soon. Later (maybe tomorrow or the day later), I will add the paging and sort for shortest/longest keywords, count and density, I think it will make our life easier.

What's next

I think I will take the next step after a few days, make the parser more intelligent, know to strip out HTML tags and parse text only. Currently, with the basic, simple and stupid version, you only can put a sentence, paragraph to parse for keywords, if you put into it the whole HTML page, it will also treats HTML Tag Name as keywords, it's not so good. I think a dedicate parser for HTML will be better.

OK, Talk enough, go and check it out

Yah, I talk so much about it, why not try it yourself? Goes to my Free Simple Keyword Parser, a basic, simple, a little stupid but useful (at least for me) Keyword Parser. Feel free to give me your comment, love to hear from you.