Slide 36
Slide 36 text
8th place
• 6th place winner in the 2020 birdcall competition, again using SED. (dealing with the weak label problem)
• Use 5 or 20 second segments for training and 40 second segments for inference, the longer the better.
In addition, the inference is done with overlap, like in 0-40 seconds followed by 20-60 seconds.
(incorporating anterior-posterior segment information)
• Augmentation: Gaussian noise, pink noise, volume adjustment, pitch shift
(Mixup also worked well, but could not be included in the final submission due to computational resource issues.)
• Loss function is characteristic. (BCEFocal2WayLoss)
• Primary labels and secondary labels are treated the same way.
• Psudo labeling (dealing with noisy label problem)
• There are two thresholds, call threshold and nocall threshold, and bird species that exceed the call threshold are considered
positive, while segments where no bird species exceeding the nocall threshold are also given "nocall."
(Bird labels and "nocall" can coexist.)
• Exclude bird species that should not exist based on regional information, even if they are predicted.
(metadata incorporation)
• Calculate F1 score for bird call and nocall lines separately and derive CV as 0.54 * nocall_f1 + 0.46 * call_f1
(robust CV strategy)