Skip to content

Conversation

saschaseniuk
Copy link

Cookbook Pull Request

Description

This PR adds a new European Company Data Enrichment notebook that demonstrates how to
use Mistral AI with Serper.dev for enriching company data across European markets.
The notebook provides:

  • Automated web search using Serper API for reliable company data retrieval
  • Multi-language support for European countries (DE, FR, ES, IT)
  • Structured data extraction using Mistral AI function calling
  • Google Colab compatibility with flexible API key handling
  • Batch processing capabilities for multiple companies

Dependencies: Serper.dev API key (free tier available at https://serper.dev)

Type of Change

What type of PR is it?

  • New Cookbook
    • Notebook File
      • Does it work on google colab?
    • Markdown File
  • Cookbook Update
    • Code Refactoring
    • Bug Fix
  • README.md Update

  • Other (please describe):

Cookbook Checklist:

  • My code is easy to read and well structured.
  • I've tagged the versions of any dependency required.
  • I have performed a self-review of my own code.
  • I have commented my code, particularly in hard-to-understand areas.
  • My changes generate no new warnings or errors.

  • My changes do not concern the cookbooks.

README.md Checklist

  • I've added my cookbook to the table.

  • My changes do not concern the README file.

Additional Context

This notebook addresses the need for reliable European company data enrichment by:

  1. Using Serper.dev instead of direct Google Search API to avoid blocking issues
  2. Implementing country-specific search templates for better localization
  3. Providing structured output using Pydantic models and Mistral AI function calling
  4. Including confidence scoring based on data completeness
  5. Supporting both individual and batch processing workflows

The notebook is designed to be production-ready while remaining educational and easy
to understand.

This template accurately reflects your contribution and checks all the appropriate
boxes. You can copy and paste this into your PR description!

│                                                                                          │
│   - New notebook for enriching European company data using Mistral AI and Serper web     │
│   search                                                                                 │
│   - Supports multiple European languages (DE, FR, ES, IT) with localized search          │
│   queries                                                                                │
│   - Includes Google Colab compatibility and structured data extraction                   │
│   - Added entry to Third Party Tools section in README
@ravi03071991
Copy link
Collaborator

Hello @saschaseniuk ,

Thanks for the PR. Here are a few comments:
1. Please add a Google Colab link at the start. You can refer to this notebook for guidance.
2. Please add more details about the use case at the beginning to help users better understand it.
3. Could you move the setup instructions to the top? This will help users understand how to obtain Mistral and Serper API keys.
5. Please use the latest Mistral client. ChatMessage and MistralClient have been updated in the latest version.
6. I believe this use case can be solved using the StructuredOutput mechanism instead of the function-calling approach. Please check the docs and see if you’d prefer to update the implementation.
7. Run the cookbook and show the outputs of each cell so that users can get a better sense of how things are working.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants