Metadata-Version: 2.1 Name: newspaper3k Version: 0.2.8 Summary: Simplified python article discovery & extraction. Home-page: https://github.com/codelucas/newspaper/ Author: Lucas Ou-Yang Author-email: lucasyangpersonal@gmail.com License: MIT Platform: UNKNOWN Classifier: Programming Language :: Python :: 3 Classifier: Natural Language :: English Classifier: Intended Audience :: Developers Requires-Dist: beautifulsoup4 (>=4.4.1) Requires-Dist: Pillow (>=3.3.0) Requires-Dist: PyYAML (>=3.11) Requires-Dist: cssselect (>=0.9.2) Requires-Dist: lxml (>=3.6.0) Requires-Dist: nltk (>=3.2.1) Requires-Dist: requests (>=2.10.0) Requires-Dist: feedparser (>=5.2.1) Requires-Dist: tldextract (>=2.0.1) Requires-Dist: feedfinder2 (>=0.0.4) Requires-Dist: jieba3k (>=0.35.1) Requires-Dist: python-dateutil (>=2.5.3) Requires-Dist: tinysegmenter (==0.3) Newspaper3k: Article scraping & curation ======================================== .. image:: https://badge.fury.io/py/newspaper3k.svg :target: http://badge.fury.io/py/newspaper3k.svg :alt: Latest version .. image:: https://travis-ci.org/codelucas/newspaper.svg :target: http://travis-ci.org/codelucas/newspaper/ :alt: Build status .. image:: https://coveralls.io/repos/github/codelucas/newspaper/badge.svg?branch=master :target: https://coveralls.io/github/codelucas/newspaper :alt: Coverage status Inspired by `requests`_ for its simplicity and powered by `lxml`_ for its speed: "Newspaper is an amazing python library for extracting & curating articles." -- `tweeted by`_ Kenneth Reitz, Author of `requests`_ "Newspaper delivers Instapaper style article extraction." -- `The Changelog`_ .. _`tweeted by`: https://twitter.com/kennethreitz/status/419520678862548992 .. _`The Changelog`: http://thechangelog.com/newspaper-delivers-instapaper-style-article-extraction/ **Newspaper is a Python3 library**! Or, view our **deprecated and buggy** `Python2 branch`_ .. _`Python2 branch`: https://github.com/codelucas/newspaper/tree/python-2-head A Glance: --------- .. code-block:: pycon >>> from newspaper import Article >>> url = 'http://fox13now.com/2013/12/30/new-year-new-laws-obamacare-pot-guns-and-drones/' >>> article = Article(url) .. code-block:: pycon >>> article.download() >>> article.html '