Skip to content

💡[Feature]: Pipeline to predict whether given pdf is malicious or not #1425

@DarshAgrawal14

Description

@DarshAgrawal14

Is there an existing issue for this?

  • I have searched the existing issues

Feature Description

I would like to contribute by developing a pipeline that, when provided with a PDF, extracts metadata, content, and other relevant features. These extracted elements are then processed and passed to a model, which predicts whether the PDF is malicious.

Use Case

Figure out PDFs with malware

Benefits

No response

Add ScreenShots

Untitled.video.-.Made.with.Clipchamp.6.mp4

Priority

High

Record

  • I have read the Contributing Guidelines
  • I'm a GSSOC'24 contributor
  • I want to work on this issue

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions