Diffbot

Turn websites into data in seconds.

Check out Diffbot on GitHub.

Extract Article Text

Inputs

  • URL - the website link you want to pull text from. (E.g. wayscript.com)

Outputs

  • Website Text - The websites text.

  • JSON Data - The raw JSON results from Diffbot.

Diffbot

Data from any Shopping or e-Commerce Product Page

Inputs

  • URL - the e-commerce website link you want to pull text from.

Outputs

  • Product Name

  • Product Brand

  • Product Price

  • SKU Number

  • Description

  • Image URL

  • Product Availability

Forum/Discussion/Product Conversations and Reviews

This mode, pulls lists of all of the posts on the forum page.

Inputs

  • URL - the forum link you want to pull text from.

Outputs

  • Post Title

  • Post Author

  • Post Text

  • Post Author URL

  • Post Date

  • Post Language