Gsoc25 refactor analyzer tests #2886

pranjalg1331 · 2025-06-05T08:24:07Z

(Please add to the PR name the issue/s that this PR would close if merged by using a Github keyword. Example: <feature name>. Closes #999. If your PR is made by a single commit, please add that clause in the commit too. This is all required to automate the closure of related issues.)

Description

Please include a summary of the change and link to the related issue.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue).
[ x] New feature (non-breaking change which adds functionality).
Breaking change (fix or feature that would cause existing functionality to not work as expected).

Checklist

Important Rules

If you miss to compile the Checklist properly, your PR won't be reviewed by the maintainers.
Everytime you make changes to the PR and you think the work is done, you should explicitly ask for a review by using GitHub's reviewing system detailed here.

pranjalg1331 · 2025-06-05T13:38:08Z

Hello mentors, I hope you’re all doing well.
In this PR, as discussed in the proposal, I have implemented a parent test class that runs unittest subtests for all supported analyzer observable types. I also updated the test_nvd_cve to work with this new base test class. Could you please review it before I extend this approach to the other analyzers?

fgibertoni

Thanks for your work!
I like the general approach you're proposing here. My only consideration is about the analyzer_mocks.py file: I like having a centralized place to hold the configurations but I think that each mock should be related to one or two classes at most. So I don't see the benefits of having a separate file and separate dictionary to hold them and having to update it every time a new analyzer/plugin is created. This also imply that for every plugin there will be a similar file, right ?

I would like also to hear what you think about this, and asking for @mlodic and @drosetti for an opinion

mlodic · 2025-06-07T15:15:01Z

Thank you for asking early review. I agree with Federico. General approach is really good and I love it. I think that the mocks should stay inside the related analyzer files. An idea is to make the BaseAnalyzerTest an ABC class and add an abstract property that must be declared by each Analyzer Test that would contain the patch.
In this way, in case there are some different analyzers that use the same patch, you could use another level of inheritance to make it re-usable between difference instances.

pranjalg1331 · 2025-06-09T12:43:52Z

Thanks for the feedback! I tried using an ABC with abstract properties and shared test methods, but Django’s test discovery tries to instantiate all TestCase subclasses—even abstract ones. This causes a TypeError when abstract properties aren’t implemented (So I cannot define a base test inside an abstract class).

fgibertoni · 2025-06-12T07:38:47Z

Can you post a little example of how you did the implementation? So we can help you better 😄

pranjalg1331 · 2025-06-12T07:50:58Z

Currently, I have implemented the Base Class without using the ABC class. The analyzer's mocked data is in the same file as the analyzer test class. The base class has a get_mocked_response() function which needs to be overridden by every test class that inherits it.

drosetti · 2025-06-12T13:56:06Z

Thanks for the feedback! I tried using an ABC with abstract properties and shared test methods, but Django’s test discovery tries to instantiate all TestCase subclasses—even abstract ones. This causes a TypeError when abstract properties aren’t implemented (So I cannot define a base test inside an abstract class).

The auto discovery should look for the files called test_*.py, if you put the abstract classes into files with different names shouldn't run.

pranjalg1331 · 2025-06-12T18:55:26Z

Yes, @drosetti. The base test case is not directly discovered in base_test_class.py, but when another class inherits it, the base test case is also run for the base class.
Thus, with ABC class we get an error like this
TypeError: Can't instantiate abstract class BaseAnalyzerTest with abstract methods analyzer_class, mock_patch_key

pranjalg1331 · 2025-06-13T14:29:07Z

Hello mentors,

Over the past few days, I’ve been exploring the possibility of defining a common base structure for unit testing file-based analyzers. However, I’ve noticed that these analyzers vary significantly — some are Docker-based, while others like Androguard don’t rely on mock data at all.

Given this diversity, I’m finding it challenging to apply a uniform strategy across all of them. I’d appreciate your thoughts on whether we should still aim for a shared base test structure (similar to what we have for observable analyzers), or if it would be more practical to focus on writing well-structured, analyzer-specific unit tests for file_analyzers.

Looking forward to your guidance. 😄

AnshSinghal · 2025-06-14T05:00:59Z

Hi @pranjalg1331 ,

I had previously explored this task a bit and just wanted to share a quick suggestion if that can help you.
How about a base test class, say BaseFileAnalyzerTest, that handles common setup like creating test jobs and files. Then, each analyzer can have its own test class inheriting from this base, adding specific mocks for Docker or skipping mocks for analyzers like Androguard. Since analyzers support different file types, the base class could parameterize the file type or have a list of supported types per analyzer, testing each type with _analyze_sample.
Totally appreciate your work on this — feel free to ignore if you’ve already considered it or have a better approach!

fgibertoni · 2025-06-17T05:39:37Z

Hello @pranjalg1331,
I think that the approach suggested by @AnshSinghal should be the way to go. A common class for all file analyzers and then the specific classes for each analyzer.
You can also extend the base BaseFileAnalyzerTest with different subclasses, e.g. XXXFileAnalyzerTest for each "classic" file analyzer, XXXDockerFileAnalyzerTest that contains specific code for Docker based analyzers. The analyzers that don't rely on mocks should be also treated specifically to use generic testing even without mocks.
What do you guys think ? @mlodic @drosetti

Also, let me know if you have any other doubts on this 😃

pranjalg1331 · 2025-06-18T14:07:37Z

Hello @fgibertoni
I’m a bit hesitant to introduce jobs into our unit tests, because that would move them away from “pure” unit testing and would largely replicate what we already cover elsewhere. 
I’ve also noticed that some analyzers rely on mocked responses, while others—such as APKID and BoxJS—don’t appear to use any. Given that the testing logic varies so much across analyzers with no mocked responses, I’m not sure a common base test class would offer many advantages.

Could you shed some light on why APKID and BoxJS don’t include mocked data? Is that a deliberate choice, or simply an area we haven’t addressed yet?

mlodic · 2025-06-19T10:33:29Z

Could you shed some light on why APKID and BoxJS don’t include mocked data? Is that a deliberate choice, or simply an area we haven’t addressed yet?

mocked data has been introduced at a later point and those are old analyzer, that's the reason. Ideally, all analyzers should have a decent mock.

mlodic · 2025-06-19T10:54:13Z

I’m a bit hesitant to introduce jobs into our unit tests, because that would move them away from “pure” unit testing and would largely replicate what we already cover elsewhere. 

It makes sense to me

Given this diversity, I’m finding it challenging to apply a uniform strategy across all of them. I’d appreciate your thoughts on whether we should still aim for a shared base test structure (similar to what we have for observable analyzers), or if it would be more practical to focus on writing well-structured, analyzer-specific unit tests for file_analyzers.

To me, it makes sense to use common strategies for only the analyzers that are similar to each other. Yes, there are differences, but they are not many. I think that you can try starting low and easy, implementing the tests for each analyzer and when you find two of them that are different to all the others but similar to each other, create a structure for them.

What Federico suggested "e.g. XXXFileAnalyzerTest for each "classic" file analyzer, XXXDockerFileAnalyzerTest that contains specific code for Docker based analyzers.", it could be an idea. The important thing is to avoid repetition and keep the code clean. When we/you see repetitive patterns, then create an additional structure to handle it

pranjalg1331 · 2025-06-25T06:53:59Z

Hello @fgibertoni,

I've implemented a base class for file analyzers that supports both Docker-based and non-Docker analyzers. It works by loading sample files based on mimetypes and mocking any external dependencies. I've also written unit tests for a few analyzers for you to review.

I'd appreciate it if you could take a look at the implementation and share your thoughts—especially on the maintainability of the current structure—before I begin scaling it to all of the analyzers.

Thank you!

fgibertoni

Hello @pranjalg1331,
I like the general approach that you followed. Great work!
I also think that maybe some new users may find it a bit tricky if they have no experience in IntelOwl, so it will for sure require some documentation. But at the moment I can't think of any way to improve it.
Let's hear also from @mlodic and @drosetti if they have some other suggestions 😄

mlodic

seems good work :)

mlodic · 2025-06-26T11:41:40Z