Slide 12
Slide 12 text
Building vector representations
● Text
○ various NLP techniques
● Source code embeddings
○ explicit features (software metrics, simple NLP features, path-based representations, ...)
○ implicit features (N-grams, AST encodings, feature hashing, autoencoders, GNNs, ...)
○ on different levels
■ tokens, methods, API calls, system events, execution traces, code changes, ...
● IR, binary code
○ paths in CFG, NLP features, bitmaps, ...
12