Parsing downloaded HTML content

marian.kocman · November 7, 2023, 8:33am

Hello, integrators!

I would like to create a watchdog for monitoring prices on predefined e-shops. I’m using a REST API connector to download the content of the HTML pages, and it’s working perfectly.

However, I’m encountering difficulties when it comes to extracting the name and price using JS Mapper. Could you provide me with some hints or guidance on this matter?

For example, I want to download the name and the price from this URL:

The price is located in the folloving DIV marked with m-price__price class.

How can I effectively get it?

Thank you!

tomas · November 7, 2023, 7:16pm

Hi Marian,

do you mean something like this: node-html-parser - npm?

Tomas.

marian.kocman · November 8, 2023, 6:04am

Exactly. Unfortunately, I think I cannot use this library in a JS mapper.

tomas · November 8, 2023, 12:27pm

This is available in Node.JS processor only. It is available in Integray Premium license. In current version we can add 3rd party modules using special workaround, however in future release we will have possibility to add modules for Node.js and Python services directly from application.

libor.zoubek · November 9, 2023, 12:19pm

Hi Marian,

In case of standart license or trial environment, you may use regular expressions to find relevant content in HTML page. It’s quite far from ideal, but possible solution.

Either way, scrapping HTML pages will likely lead to errors on unexpected result, because you do not control 3rd party web pages and those pages may change anytime.

marian.kocman · November 9, 2023, 12:21pm

Thank you, Libor, I’ll use the Regex!

system · November 16, 2023, 12:21pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Reading data from public api Platform discussions endpoint	2	119	October 17, 2023
HTML Connector JSON input parsing Platform discussions task , connector	3	129	October 27, 2023
Render HTML website as task/endpoint output Platform discussions	5	117	June 22, 2023
How to parse monday.com data Handy solutions js-mapper , mondaycom , json-parser	0	80	July 26, 2023
Introducing Praded (2023.05.001) release - The Next Leap in Your Integration Journey Announcements	5	102	March 11, 2024

Parsing downloaded HTML content

Related topics