Learning from Large-scale Mutagenesis Data