Google • First technique to match SOTA performance in CIFAR-10 and PTB • Agent samples architectures from search space • Evaluates architectures by training them from scratch • Expensive more than 24000 GPU hours required to find the architectures
NAS • Uses weight sharing based performance estimation strategy for evaluation • Provides search space options as macro and micro • Comparable results to NAS with just 24 GPU hours
curve evaluation: Evaluating architectures sampled at initial epoch and final epoch of ENAS • Performance estimation strategy: Training 5 architectures sampled at epoch 155 from scratch
search space. • ENAS Search strategy is as good as random search. • ENAS performance estimation strategy is biased and overfit on the models with more shared weights. Future Work: • Applying same experimental methodology to DARTS, PENAS and NAO.
- Action space: Search space - Actions: Sampled architectures - Reward : input from performance estimation strategy For ENAS: At first step the controller receives an empty input.