Search

Zyte

Zyte

The Zyte connector allows you to seamlessly pull data from the web using Zyte's powerful web scraping API into Datagrid. This integration enables you to leverage Datagrid's data analysis and visualization tools with web-scraped data, providing enhanced insights and reporting capabilities for your data extraction needs.

1. How-to

1. Prerequisites

To configure the Zyte connector, follow these steps:

  • An active Zyte account with access to the Zyte API.
  • A Zyte API key. You can find this in your Zyte account dashboard.
  • Select the data you want to import into Datagrid.

2. Connect

Creating a dataset from the Zyte connector involves configuring the web scraping parameters and selecting the data you want to extract:

  1. Connect Zyte App: a. Click on the "+ Create” Button on the top left of the screen. b. Select the "Connect Apps" item. c. Search for the Zyte connector from the list. d. Enter your Zyte API key. e. Configure the Zyte API parameters, such as the target website URL, the desired data extraction rules, and any necessary request headers or cookies. f. Click on the “Next” button.
  2. Pick your Data: a. Define the data fields you want to extract from the target website using Zyte's data extraction schemas. b. Pick the Zyte data you want to include in your dataset (e.g., product names, prices, descriptions). c. Click the “Start First Import” Button to start syncing your Zyte dataset.

3. Set Up a Schedule

Scheduling regular data pulls ensures your Datagrid datasets remain up-to-date with the latest information from the web:

  1. Navigate to Zyte Dataset: a. Go to the left side panel and locate and click on the Zyte dataset you created.
  2. Schedule Settings: a. Click on the “...” on the top right of the dataset. b. Click on “Edit Pipeline” to edit your connector's name. c. Click the “Schedule” button on the right, beside the “Import Configuration” button.
  3. Configure Schedule: a. Set the desired frequency for data pulls (e.g., daily, weekly, monthly). b. Specify the time of day for the data pull to occur. c. Specify downtime if needed – when the sync should not happen. d. Click the “Update” button to update the new configuration.

2. Data Access

  • Product Data (names, prices, descriptions, images)
  • Article Data (titles, authors, content, publication dates)
  • Real Estate Data (addresses, prices, property features)
  • Job Postings (titles, descriptions, locations, salaries)
  • Reviews (ratings, comments, dates)
  • Any other data that can be extracted from a website using Zyte's web scraping API

3. Use Cases

  • E-commerce Price Monitoring: Use Datagrid to track product prices across multiple e-commerce websites and identify pricing trends.
  • Market Research: Extract and analyze data from industry websites, news articles, and social media to gain insights into market trends and competitive landscapes.
  • Lead Generation: Scrape contact information from business directories and company websites to generate leads for sales and marketing campaigns.
  • Content Aggregation: Aggregate content from multiple websites into a single Datagrid dataset for content curation and analysis.
  • Brand Monitoring: Track mentions of your brand across the web and social media to monitor brand reputation and identify potential issues.

4. FAQ

Q: What types of data can I import from Zyte into Datagrid?

  • A: You can import any data that can be extracted from a website using Zyte's web scraping API.

Q: How often can I schedule data pulls from Zyte?

  • A: You can schedule data pulls daily, weekly, or monthly, depending on your needs.

Q: What permissions are required to connect Zyte to Datagrid?

  • A: You need an active Zyte account with the necessary permissions to access the Zyte API. Ensure you have the correct API key.

5. Support & Additional Resources