Source code embeddings ◦ explicit features (software metrics, simple NLP features, path-based representations, ...) ◦ implicit features (N-grams, AST encodings, feature hashing, autoencoders, GNNs, ...) ◦ on different levels ▪ tokens, methods, API calls, system events, execution traces, code changes, ... • IR, binary code ◦ paths in CFG, NLP features, bitmaps, ... 12