Monarch geneset OGS2.0

DPOGS207358
TranscriptDPOGS207358-TA1998 bp
ProteinDPOGS207358-PA665 aa
Genomic positionDPSCF300188 + 376880-382671
RNAseq coverage849x (Rank: top 15%)
Annotation
HeliconiusHMEL0088600.062.38% 
BombyxBGIBMGA012704-TA4e-3880.61% 
DrosophilaCG3558-PC3e-7330.65% 
EBI UniRef50UniRef50_E2B2676e-9333.12%UPF0518 protein CG3558 n=9 Tax=Formicidae RepID=E2B267_CAMFO
NCBI RefSeqXP_395745.32e-9333.57%PREDICTED: similar to CG3558-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3320240422e-9533.51%UPF0518 protein [Acromyrmex echinatior]
NCBI nr blastxgi|1953873299e-7731.20%GJ17503 [Drosophila virilis]
Group
KEGG pathway 
InterPro domain[8-181] IPR0193841e-33Retinoic acid induced 16-like protein
Orthology groupMCL11028 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207358-TA
ATGTCAGCCAGCTGTCCGCCATTGGCTCATTGTCTCCTTGCTGGAAACGCGTGTCCTATACTTGCCACCGGTCTAAGCGGTCTCTATTCCTTACTGCCGAGGTCGCTGTGCGTCGCTCGACTGCTGCCAGATGACGTGAACAGAGTACACAAGCTCACATTGTTTATTGACTCGCTAGAGTTTTGCAATGCTGTGGCTCAGGTTGCGCACCCATCAATAAAGAGACACCTATTGGATTTGCTTTACCAAGGTTTCTTAGTGCCCGTCATGGGTCCCGCATTATTGCAGAGCGGTGCGGCGGAACAAGCGACGGCTATGGCCTATTTGGAGCTTTTACTGAGATGCGTCACCTCTGGTGGTCTGCTAAGGGTTATACTGCATTATTTATTCACTTACGAATACGACGGTGTCAGAGTCATGGATGTGCTGCTCGGACGGTTAGAGGGGAATTCGCAATTGTCGTTGGTGACGCTGTCATTAATGGAAACTCTGATAGATCTGAACACCGAAGACGTTATGATGGAGCTCGTATATAAATATCTACTGAGCGGGAACCACTTGTTGGCTGCGTACAGAAGCAAGCTCACCGACCCGGAGCCTTATAGAGAGGCTGCGGTAGCATTCCTCGGTCTTAGTCCGAATTGTTGTATGAGATCTGTACGTAAAACTAGGGAGTGGGTCGACGAATTGGCTATGAGCGGAAGGAGGATATTGGATTTTAAATGTGACAGTGACAACCGCAGAGTCAACGGCGTGCACGGCGACATCGCCATGAGCAACCTGGTCCAGGAGGTCAACTTTGACTACGGGAATCCAAACGAGACGCTCTTCAGCAACTACCACGCGTATCTCTGTGATGCTAGATTCGAAATAGTATCGAGGACGGTAGCGAGCGCTAACTGGACGAACAGATACGACAGTGTAGTCCCTAAAAGAGAAACAAATAATAATACGAAAGTAACGAAAGAGATGGTCGCGGGGACGCATATGAGGGAGCAGGACAGCCTCGTCTCACTGTCGGCCAGTGGCTACGACACCCTCAAGAACAGTGACACGTCCGAGAAGGAGGCTCACAGCCTGAGCTCACTGAACGAAGCGGTTGAGAACAGAAGCGATAACAATGAAACGAAAGACCAGGACAGCTTGACCTCCATGGGTGAGAGTAGCGGCTACGACAGCTTCCTGTATAGAGTGGACGATCAAGACGACGAAGGGTTCTCAGAGGATTCCTTCAAGAAGAGTATAATCAGTGTGAAACAGGAAGTTAGGGAGTACGAGGAGCCGGTGAAGAGCTGGAGGGTCAGCGCCGAAACGATTATCGGTCCTCTCCTAGCGAGTCTTCTCCGTAAGTCAGCCAGTTTCCTAGAATCCGAGCTGGCTGTCGCGTGTCGCGTGTCGAGTGTTCTAAGCAAGCTAGCCTATTTCCCAACACCGATACTGTCAGCGTTGCTGCTGTGTCCCGGATTCCTGTTACAACCAAACGTGCCATCGCTGTTCCAAATCCTGAGTCGTTTAAAACAAGAACTGGACGAGTTGACGGCTGGTCTCGATAACGTCGGCGAGTTGGTTGATAAGGCGCGAGTGTTTCTCATACACCGCGAGATGGCGCTGACGAGGCGGCCGCAGAAAAATGAAGGACAGTCGAAGTCAAATCCAGCCACACAGAGAGACGAGTCACCTTTCCGACGAGTCGAGGTCAAGAGACGGAGCATCTCGTCCAGCTTCTCCAGCATGTTCAACAGGAAGACGTCCTTATCACCATCACAACAAAACTCAAACAGCCTACAAAACACCAGCAACATCAGCCCAGCACAGAAGCGTCCGATATTCACACAGGAGGTGTACTGGAATCAATCCATAACGTTGAGATCAATCCTGAACGCTGTCATACTGGACGAGTGGTTGAAGGAACTGTCGGCCATATGCGAGGAACACGCTCTGCGTCTTACAGCCGACGTCTACGAAAACGACACGTACATACGATCCGTGGCATTATGA

Protein sequence:

>DPOGS207358-PA
MSASCPPLAHCLLAGNACPILATGLSGLYSLLPRSLCVARLLPDDVNRVHKLTLFIDSLEFCNAVAQVAHPSIKRHLLDLLYQGFLVPVMGPALLQSGAAEQATAMAYLELLLRCVTSGGLLRVILHYLFTYEYDGVRVMDVLLGRLEGNSQLSLVTLSLMETLIDLNTEDVMMELVYKYLLSGNHLLAAYRSKLTDPEPYREAAVAFLGLSPNCCMRSVRKTREWVDELAMSGRRILDFKCDSDNRRVNGVHGDIAMSNLVQEVNFDYGNPNETLFSNYHAYLCDARFEIVSRTVASANWTNRYDSVVPKRETNNNTKVTKEMVAGTHMREQDSLVSLSASGYDTLKNSDTSEKEAHSLSSLNEAVENRSDNNETKDQDSLTSMGESSGYDSFLYRVDDQDDEGFSEDSFKKSIISVKQEVREYEEPVKSWRVSAETIIGPLLASLLRKSASFLESELAVACRVSSVLSKLAYFPTPILSALLLCPGFLLQPNVPSLFQILSRLKQELDELTAGLDNVGELVDKARVFLIHREMALTRRPQKNEGQSKSNPATQRDESPFRRVEVKRRSISSSFSSMFNRKTSLSPSQQNSNSLQNTSNISPAQKRPIFTQEVYWNQSITLRSILNAVILDEWLKELSAICEEHALRLTADVYENDTYIRSVAL-