Epic 2.16 Presentation Generator #203

irfanariyaz · 2025-03-25T16:58:04Z

Description

Added image generation functionality to the slide generator tool. This enhancement allows dynamic, context-aware image generation for presentation slides using Google's Imagen model and Firebase for image storage. The implementation supports:

Generating images based on slide content and context
Uploading generated images to Firebase
Supporting multiple image styles and templates
Handling image generation for different slide types

Related Issue

This is a feature enhancement for the presentation generator.

Type of Change

Please select the type(s) of change that apply and delete those that do not.

Proposed Solution

Implemented image generation in the slide generator with the following key components:

ImageGenerator class using Vertex AI's Imagen model
Firebase integration for image storage and URL generation
Dynamic image prompt generation based on slide content
Configurable image generation parameters (width, height, aspect ratio)
Parallel image generation for multiple slides

Key modifications:

Added imagen.py for image generation logic
Updated tools.py to include image generation in slide creation workflow
Enhanced generate_slides() method to handle image generation
Added error handling and logging for image generation process

How to Test

Provide instructions on how to test these changes. Include details on test configurations, test cases, and expected outcomes.

Unit Tests

List the unit tests added or modified to verify your changes.

test_executor()
test_generate_slide_image()
test_executor_missing_inputs()
test_validate_slides_content()
test_validate_slides_content_with_garbage()
test_validate_slides_content_empty_slides()
test_slide_generator_compile_context()
test_slide_model()
test_slide_presentation_model()
test_imagen_generate_image()

Documentation Updates

Indicate whether documentation needs to be updated due to this PR.

[] Yes
No

If yes, describe what documentation updates are needed and link to the relevant documentation.

Checklist

I have performed a self-review of my code.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have added tests that prove my fix is effective or that my feature works.
New and existing unit tests pass locally with my changes.
Any dependent changes have been merged and published in downstream modules.

Additional Information

Inputs for outline generator:
{
"user": {
"id": "string",
"fullName": "string",
"email": "string"
},
"type": "tool",
"tool_data": {
"tool_id": "outline-generator",
"inputs": [
{
"name": "topic",
"value": "Lang chain"
},
{

        "name": "instructional_level",
        "value": "graduate"
    },    
    {
        "name": "n_slides",
        "value": 6
    },
    {
        "name": "file_type",
        "value": ""
    },
    {
                                    
        "name": "file_url",
        "value": ""
    },
    {
        "name": "lang",
        "value": "en"
    }
]

}
}

Input for slide generator:
{
"user": {
"id": "string",
"fullName": "string",
"email": "string"
},
"type": "chat",
"tool_data": {
"tool_id": "slide-generator",
"inputs": [
{
"name": "slides_titles",
"value": [
"Introduction to LangChain: A Conceptual Overview",
"LangChain Architecture: Modules & Core Components",
"Advanced LangChain Capabilities: Agents & Memory",
"Real-World LangChain Applications: Case Studies",
"Building Your First LangChain Application: A Practical Example",
"Future Directions and Challenges in LangChain"
]
},{
"name": "topic",
"value": "Lang chain"
},
{

        "name": "instructional_level",
        "value": "graduate"
    },    
    {
        "name": "file_type",
        "value": ""
    },
    {
      
        "name": "file_url",
        "value": ""
    },
    {
        "name": "lang",
        "value": "en"
    }
]

}
}
Add any other information that might be useful for the reviewers.

Loom video:https://www.loom.com/share/5c82751cd6224410a88d7046b88d8d29?sid=12145d25-99d2-406e-b7fb-6e23bf36893b

irfanariyaz · 2025-03-25T22:16:45Z

Generated output

{
"data": [
{
"title": "Introduction to LangChain: A Comprehensive Overview",
"template": "titleAndBody",
"content": "Welcome! This presentation explores LangChain, a powerful framework for developing applications powered by large language models (LLMs). We'll cover its core components, practical applications, and advanced techniques, equipping you with the knowledge to build robust LLM-driven solutions. LangChain simplifies the complexities of LLM integration, enabling efficient development and deployment of sophisticated applications.",
"image_url": null
},
{
"title": "Core Components of LangChain: Modules & Architectures",
"template": "titleAndBullets",
"content": [
"LLMs: Integration with various LLMs (OpenAI, Hugging Face, etc.)",
"Prompts: Techniques for crafting effective prompts for LLMs.",
"Indexes: Structuring and accessing external data for LLMs.",
"Chains: Combining multiple components to create complex workflows.",
"Agents: Enabling LLMs to interact with external tools and APIs.",
"Memory: Maintaining context across multiple interactions with an LLM."
],
"image_url": "https://storage.googleapis.com/marvel-ai-firebase.firebasestorage.app/slides/Lang_chain/slide_1.png"
},
{
"title": "LangChain in Action: Practical Use Cases and Examples",
"template": "titleAndBullets",
"content": [
"Chatbots: Building conversational AI agents.",
"Question Answering Systems: Creating systems that answer questions from various data sources.",
"Summarization: Generating concise summaries of lengthy documents.",
"Data Analysis: Using LLMs for insightful data interpretation.",
"Creative Writing Assistants: Aiding writers with idea generation and text refinement."
],
"image_url": "https://storage.googleapis.com/marvel-ai-firebase.firebasestorage.app/slides/Lang_chain/slide_2.png"
},
{
"title": "Advanced LangChain Techniques: Memory & Agents",
"template": "twoColumn",
"content": {
"leftColumn": "Memory: Explore different memory types (ConversationBufferMemory, ConversationSummaryMemory) and their impact on maintaining context in long conversations. Discuss challenges and best practices for managing context across multiple interactions.",
"rightColumn": "Agents: Examine different agent types (ZeroShotAgent, ToolAgent) and their capabilities. Illustrate how agents can enhance LLM applications by enabling interaction with external tools and APIs. Discuss real-world examples."
},
"image_url": null
},
{
"title": "Building Robust LLM Applications with LangChain",
"template": "titleAndBody",
"content": "This section focuses on best practices for building robust and scalable LLM applications using LangChain. We will discuss topics such as error handling, efficient data management, prompt engineering strategies, and deployment considerations. Real-world examples of successful deployments will be presented.",
"image_url": null
},
{
"title": "Future Trends and Challenges in LangChain",
"template": "titleAndBullets",
"content": [
"Improved Agent Capabilities: More sophisticated agents with enhanced reasoning and decision-making abilities.",
"Enhanced Memory Management: More efficient and scalable memory solutions for complex applications.",
"Integration with other Frameworks: Seamless integration with other AI and machine learning tools.",
"Addressing Ethical Concerns: Developing responsible and ethical LLM applications.",
"Standardization and Interoperability: Establishing standards to ensure compatibility and interoperability."
],
"image_url": null
}
]
}

buriihenry

Since there is a quota limit when generating the images and storing them in Firebase. Is it possible to store them in the Google Cloud Bucket?

irfanariyaz · 2025-04-03T18:31:33Z

i will try storing in the google cloud bucket and update you.

irfanariyaz · 2025-04-05T14:05:17Z

I got the same issue with the GCS too. The error says: 429 Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: imagen-3.0-generate

buriihenry · 2025-04-15T19:48:26Z

app/services/schemas.py

Is it possible not to edit this file? Instead can we have schemas.py file inside the tools/presentation_generator

yes that is possible .i can refractor that

buriihenry · 2025-04-15T19:51:49Z

Dockerfile

There's no need to commit this Dockerfile. It should remain local since it contains configurations and images that shouldn't be pushed to the remote branch

ok i'l remove this.

…az/marvel-ai-backend into presentation-generator

irfanariyaz · 2025-04-17T08:47:47Z

Done the changes mentioned above.

stevenrayhinojosa-gmail-com · 2025-05-04T02:11:36Z

I've thoroughly tested your PR for the presentation generator feature and have identified several issues that need to be addressed before it can be merged.

Current Status
The PR appears to be a work in progress with several incomplete components:

Missing Prompt Files:
The slide generator is looking for a prompt file that doesn't exist: slide_generator_prompt_batch.txt
This causes most of the tests to fail with FileNotFoundError
Import Errors:
The outline generator tests are failing because they're trying to import OutlineGeneratorInput from app.services.schemas, but this class doesn't exist in that module
Test Failures:
The original presentation generator tests are failing with document loading errors
The updated slide generator tests have assertion errors due to mismatched parameters
Incomplete Implementation:
The directory structure for the updated presentation generator is in place, but some implementation files appear to be incomplete
Recommendations
To make this PR ready for merging, I recommend the following steps:

Fix Missing Files:
Create the missing prompt file: app/tools/presentation_generator_updated/slide_generator/prompt/slide_generator_prompt_batch.txt
Ensure all required prompt files are included in the repository
Fix Import Issues:
Update the import paths in the tests to use the correct modules
Ensure all required schema classes are defined in the appropriate modules
Fix Test Assertions:
Update the test assertions to match the actual behavior of the code
In particular, fix the test_imagen_generate_image test to match the actual parameters being used
Complete Implementation:
Finish implementing any incomplete components
Ensure all required functionality is properly implemented and tested
Documentation:
Add documentation explaining the new two-component approach
Include examples of how to use the outline generator and slide generator together
I appreciate the work you've done so far on this feature. The architectural approach of separating the outline generation and slide generation is sound, but the implementation needs to be completed before the PR can be merged.

irfanariyaz added 2 commits March 25, 2025 09:13

updated image_generation functionality

c595ee7

updated

37b4c66

buriihenry self-assigned this Mar 27, 2025

buriihenry self-requested a review March 27, 2025 08:32

buriihenry reviewed Apr 3, 2025

View reviewed changes

irfanariyaz and others added 4 commits April 14, 2025 13:06

added image generator tool

38ccd20

updated image tool generator

cce2939

added tests

4a7c85b

Merge branch 'ai-squad-003' into presentation-generator

462bdf2

buriihenry suggested changes Apr 15, 2025

View reviewed changes

irfanariyaz added 4 commits April 17, 2025 01:38

updated image generator tool

15f8da6

Merge branch 'presentation-generator' of https://github.com/irfanariy…

c61e9b5

…az/marvel-ai-backend into presentation-generator

Merge branch 'presentation-generator' of https://github.com/irfanariy…

338b378

…az/marvel-ai-backend into presentation-generator

Remove Dockerfile from repository

3f1c677

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Epic 2.16 Presentation Generator #203

Epic 2.16 Presentation Generator #203

Uh oh!

irfanariyaz commented Mar 25, 2025

Uh oh!

irfanariyaz commented Mar 25, 2025

Uh oh!

buriihenry left a comment

Uh oh!

irfanariyaz commented Apr 3, 2025

Uh oh!

irfanariyaz commented Apr 5, 2025

Uh oh!

buriihenry Apr 15, 2025

Uh oh!

irfanariyaz Apr 16, 2025

Uh oh!

buriihenry Apr 15, 2025

Uh oh!

irfanariyaz Apr 16, 2025

Uh oh!

irfanariyaz commented Apr 17, 2025

Uh oh!

stevenrayhinojosa-gmail-com commented May 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Epic 2.16 Presentation Generator #203

Are you sure you want to change the base?

Epic 2.16 Presentation Generator #203

Uh oh!

Conversation

irfanariyaz commented Mar 25, 2025

Description

Related Issue

Type of Change

Proposed Solution

How to Test

Unit Tests

Documentation Updates

Checklist

Additional Information

Uh oh!

irfanariyaz commented Mar 25, 2025

Uh oh!

buriihenry left a comment

Choose a reason for hiding this comment

Uh oh!

irfanariyaz commented Apr 3, 2025

Uh oh!

irfanariyaz commented Apr 5, 2025

Uh oh!

buriihenry Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

irfanariyaz Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

buriihenry Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

irfanariyaz Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

irfanariyaz commented Apr 17, 2025

Uh oh!

stevenrayhinojosa-gmail-com commented May 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants