开发者

Open source equivalent to opencalais (Preferably PHP or Python?)

开发者 https://www.devze.com 2023-03-25 16:06 出处:网络
Is there an open source equivalent of op开发者_如何学运维encalais - preferably in PHP or Python?Conceptually it\'s an interesting idea. It seems to be parsing basic text or HTML content, then wrapping

Is there an open source equivalent of op开发者_如何学运维encalais - preferably in PHP or Python?


Conceptually it's an interesting idea. It seems to be parsing basic text or HTML content, then wrapping entities in microformats

Why don't you just the web API for their service?

Implementing a solution yourself should be straightforward - it's just a case of lots of regexes and microformat templates.


It seems their main service is Named Entity Recognition with some extrapolations made on top of that. NER is still a research problem in Natural Language Processing. This means there's no one-size-fits-all algorithm, though commercial providers might try to convince you otherwise.

That said, Python's nltk does have some relevant features.


Stanford NER for the Named Entity Recognition and a Dependency parser would be some of the building blocks of making an equivalent to Opencalais.

0

精彩评论

暂无评论...
验证码 换一张
取 消