Added rules regarding metaprogramming detection in yara #7425

ewjoachim · 2020-02-23T01:30:06Z

Without those additions, accessing eval/exec without mentionning neither "eval" nor "exec" not breaking any existing rule is as simple as:

vars(__builtin__)[dir(__builtin__)[102]]("""malicious_code()""")
# or
import builtins
builtins.__dict__["lave"[::-1]]("""malicious_code()""")

Note: even with these rules, we can still bypass the rules in a number of ways (let me know if it's acceptable to start listing them publicly :) )
(Edit: the more I think about it, the more I find ways :D )

xmunoz · 2020-04-05T00:19:23Z

This seems reasonable. @ewjoachim, can you add a test or two for these new rules? Look here for examples of how to do that:
https://github.com/pypa/warehouse/blob/master/tests/unit/malware/checks/setup_patterns/test_check.py

ewjoachim · 2020-04-05T08:14:32Z

Seeing what's done, I'm writing a very simple detection test for all the rules, so that we can check all of the regexes there. This, once again, shows how easy it is to circumvent the whole thing. I think it should really be checked with AST rather than regexes. Do you think a rewrite with astpath (or even bellybutton) has a chance of being merged (in another PR of course) ?

Without those additions, accessing eval/exec without mentionning neither "eval" not "exec" not breaking any existing rule is as simple as: ```python vars(__builtin__)[dir(__builtin__)[102]]("""malicious_code()""") import builtins builtins.__dict__["lave"[::-1]]("""malicious_code()""") ```

ewjoachim · 2020-04-05T08:28:42Z

Here you are, tests for every rule :)

xmunoz · 2020-04-05T16:26:00Z

I agree that AST is the way forward for this stuff. Without some more advanced parsing, we end up with issues like this: #7475

Another security researcher has some some work on python AST parsing for malware detection and described it here:
https://medium.com/@bertusk/detecting-cyber-attacks-in-the-python-package-index-pypi-61ab2b585c67

Feel free to open an issue and work on that. I'd be happy to provide a review.

xmunoz

Tests look good, thanks for adding them :)

Make sure to get everything passing and then we should be good for a merge.

xmunoz · 2020-04-05T16:30:29Z

@woodruffw do you have anything to add here?

woodruffw · 2020-04-05T17:19:11Z

Thanks for the ping!

do you have anything to add here?

I agree completely that an AST-based matcher is the way to go -- we should treat the current YARA check as a proof of concept that's easy to circumvent.

Another (more complex) option here is a mocked Python interpreter with models for potentially malicious functions. Running setup.py under this would allow us to record the calls without needing to worry about AST changes or common obfuscations like string chunking.

ewjoachim · 2020-04-05T22:33:41Z

Another (more complex) option here is a mocked Python interpreter with models for potentially malicious functions. Running setup.py under this would allow us to record the calls without needing to worry about AST changes or common obfuscations like string chunking.

Hm, yes. However creating we're getting with AST, I agree with you that we'll never be smart enough to detect code that does nasty stuff without executing it.

Your link @xmunoz was very interesting, but the researcher decided to go through static analysis, and, well, is it worth doing something that will make false-positives, and probably false-negatives too, quite easily ?

Just re-reading the current rules, I realized it still didn't take care of:

import inspect
inspect.getmodule([].__class__).__getattribute__("lave"[::-1])("print('malicious')")

(which took me only a few minutes to write). This really is a pandora box.

Sadly, while I'd love to learn more, I currently have no experience whatsoever regarding sandboxing Python :( Time to learn ! And, as you suggest, continue the discussion in a dedicated ticket :)

ewjoachim · 2020-04-05T22:55:02Z

BTW, do you want me to add __getattr__, __getattribute__ and import inspect to the current PR, or are we declaring it a lost cause for now ?

ewjoachim · 2020-04-05T23:10:10Z

Ow gosh, I've just discovered the documentation, and it looks like I didn't increment the version number. Doing this now ! https://warehouse.pypa.io/development/malware-checks/

xmunoz · 2020-04-08T15:41:58Z

BTW, do you want me to add __getattr__, __getattribute__ and import inspect to the current PR, or are we declaring it a lost cause for now ?

If you want, but I'm not going to push for it. If you're happy with the PR, I'll re-assign the reviewer to someone that can merge it.

ewjoachim · 2020-04-09T16:55:43Z

I think I'm happy with it. It will always be time to add stuff later.

ewjoachim force-pushed the patch-1 branch from f0c7ff9 to 24c6b61 Compare February 23, 2020 01:43

di requested a review from xmunoz April 4, 2020 20:39

ewjoachim force-pushed the patch-1 branch from 24c6b61 to 7ac4646 Compare April 5, 2020 08:28

xmunoz approved these changes Apr 5, 2020

View reviewed changes

ewjoachim force-pushed the patch-1 branch from 7ac4646 to 48e6b30 Compare April 5, 2020 22:36

Added tests for yara rules

9020075

ewjoachim force-pushed the patch-1 branch from 48e6b30 to 9020075 Compare April 5, 2020 22:53

Update version on YARA malware checker

725cf1c

ewjoachim mentioned this pull request Apr 5, 2020

Implement a more robust malware detector #7748

Closed

xmunoz requested a review from ewdurbin April 9, 2020 17:02

Merge branch 'master' into patch-1

85e6540

ewdurbin merged commit c488578 into pypi:master Apr 10, 2020

ewjoachim deleted the patch-1 branch April 10, 2020 17:32

MaxMood96 mentioned this pull request Dec 25, 2023

[Snyk] Security upgrade stylelint from 14.16.1 to 16.1.0 MaxMood96/warehouse#2393

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added rules regarding metaprogramming detection in yara #7425

Added rules regarding metaprogramming detection in yara #7425

Uh oh!

ewjoachim commented Feb 23, 2020 •

edited

Loading

Uh oh!

xmunoz commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

xmunoz commented Apr 5, 2020 •

edited

Loading

Uh oh!

xmunoz left a comment •

edited

Loading

Uh oh!

xmunoz commented Apr 5, 2020

Uh oh!

woodruffw commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020 •

edited

Loading

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

xmunoz commented Apr 8, 2020 •

edited

Loading

Uh oh!

ewjoachim commented Apr 9, 2020

Uh oh!

Uh oh!

Added rules regarding metaprogramming detection in yara #7425

Added rules regarding metaprogramming detection in yara #7425

Uh oh!

Conversation

ewjoachim commented Feb 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xmunoz commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

xmunoz commented Apr 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xmunoz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xmunoz commented Apr 5, 2020

Uh oh!

woodruffw commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

ewjoachim commented Apr 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ewjoachim commented Apr 5, 2020

Uh oh!

xmunoz commented Apr 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ewjoachim commented Apr 9, 2020

Uh oh!

Uh oh!

ewjoachim commented Feb 23, 2020 •

edited

Loading

xmunoz commented Apr 5, 2020 •

edited

Loading

xmunoz left a comment •

edited

Loading

ewjoachim commented Apr 5, 2020 •

edited

Loading

xmunoz commented Apr 8, 2020 •

edited

Loading