Monarch geneset OGS2.0

DPOGS209592
TranscriptDPOGS209592-TA1905 bp
ProteinDPOGS209592-PA634 aa
Genomic positionDPSCF300015 - 640795-643131
RNAseq coverage687x (Rank: top 19%)
Annotation
HeliconiusHMEL0170140.080.29% 
BombyxBGIBMGA006651-TA0.078.98% 
Drosophilaosp-PF3e-5045.74% 
EBI UniRef50UniRef50_E2A2N52e-10142.22%Protein outspread n=5 Tax=Formicidae RepID=E2A2N5_CAMFO
NCBI RefSeqXP_001944135.18e-7437.14%PREDICTED: similar to outspread [Acyrthosiphon pisum]
NCBI nr blastpgi|3360886182e-10542.08%protein outspread [Apis mellifera]
NCBI nr blastxgi|3360886181e-11341.79%protein outspread [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL15834 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209592-TA
ATGACCAACAATAGAGAAGAAGACGATCCGAGTAAATCAGAAGTGATTATACAGAGTGCGGACCTGGAGGCCTTGAAAACGTCTCAGAAAGAGGTTGCGGAGGCTGTTGACAAATACAAGACAGAGAAGCTGGGTGAGCTGTGTTCTGCGCTCGCTAACGAAACATTACACTACGCGGCGGTTGGAGACGAGCCCGAGCGACAACGTGCCGGCCTAGAAGCGGCTCGGGTACGGGAGGCGTGGGCGGCAGCACGCGACGCCCTCGGAGCAGAATTGGTACAAGCTGAGGTAGCGCGTGCGCTAGGCCGAGCGGCGCAGCTCTGCGAGAGTCGGCTGCACGAGTCCAGGAATGCGCGACTCACACTAACAGCGCAACACCGGGCCGATTTGGAACTTTGGTGGCAGGCAGCTCACGACCACCTGCGCTGCGAGATGGACGCCGCCGTACGAGATATCGCCGCGCGCTACCGCGAGTGCCTCGCTGCCGATCGGCGCCGCGAGGGCTCCCCGGGACTCGACAGCCGCGCGCTGCTGCAGCAGCTGGCCGACGTCCTCGCTCAGAAGGCTCTTGTGGACGCCCGCGTCGCCGTCCTCGAAGGCACTTACCGCGCCTCCGAACCGTTCGACGCCGGTCGCCGTTCCGAGGACGCCATCTCCTCGTTGGAGACGGATCCCGCCACGGAGGCCGAGTTCGTCTACCTGTTTCGGCACTTCTCGACCGAGTGCCGGGCGCTCTTCTCCAGTGACTGTTCGGGCAACAGTGAAGACTCGATGAAGATCGGCGAGAGCCTCGAGCGCGCGGAGGCGGCGGTGGCGGCGGCGCAGCGCAGGCTGACGGCAGACCTGGGCGACGAGCGAGCGGTCGACGACGAGGGTTCGTCCGGCGGCGGGCTGGTGCAGCGCCTAGATGCCTTGAGGCGACGCGTGGAAGCTCTGCAGTGTCCGCAGTGCGCTCGTCTACAGGAGGCTCTGGAGCGGCTCTCTGCCGAACGGGCTCGCAGCGAATGCGCGCTGGCTCAGCAAGCTGCCGCACTAGCAGCAGCTCGCCGCGCTCGAGCTCAGCTACAGGCCCAACATGAAAGAGAGCGATCCGGACTACGAGAACGAGCCCGTACACTACAGCGCCGGCTGGCCGCGCTCGACTCTGAGTATTCGGCCCAGCTGGAGAGCCTCCGAGCGGCGTATCAGCAAACGACGCAGAGCGCGGTGTCCACGGACGCGCACGGGGACGGTCTGAGGGCACGCTACCAGCAAGAGATCGAGCAGCTGAGAGCGCTGTGCGAGAAGGGGCTCATGGCTATGGAAAGCAGCCACCGCCGCATCGTGAGGGAGATGGAAGACAAGCATCGCGCGGAAAGAGATCAGCTCAGGCTGGAGAAAGAGCAAGCGTTGGCTGAAGAGACTCGCGCCACGTTAGCTGCGCTGGACGCCATGCGCAAGGCACACGAGAGCGAAGTCCGCCGCGAGGTCGACAAGTTCAAGGCGGAGTTCCTCGCACGAGGCGCTCCCGACCTGGGACAGCTCAGCTCGAGGCACCAACAAGAGATGGAGGAAATAAAACGCGAGATCCTGTCTCTGTCCGAGAAGTACTCGGTGAAGTGTGTGGAATCCGCGGCGCTGGAGGAACGGCTGGCGGCGACCGCGGCCCAGCTCGCACACGCCCACCAACACATCATGCAGTTGGACGCGCGCAACAAACAACTTCGAGCTCACATCATGTCGGAGGCGAATAATGAGCTGAAGAACTCTGAGGCTTCCTCACTCGCGCAGCTGATAGAGGACGCTCCGGCGAAGTCATCTCGAACGAGTCCTATGACTTCCGGAGGAGTGTTCACCAGCTCGGACAGCGCCAGGAACATGACGCTGTCGCCGGTGCAGGTGACGAATATAAAACCGTCTCTTTAG

Protein sequence:

>DPOGS209592-PA
MTNNREEDDPSKSEVIIQSADLEALKTSQKEVAEAVDKYKTEKLGELCSALANETLHYAAVGDEPERQRAGLEAARVREAWAAARDALGAELVQAEVARALGRAAQLCESRLHESRNARLTLTAQHRADLELWWQAAHDHLRCEMDAAVRDIAARYRECLAADRRREGSPGLDSRALLQQLADVLAQKALVDARVAVLEGTYRASEPFDAGRRSEDAISSLETDPATEAEFVYLFRHFSTECRALFSSDCSGNSEDSMKIGESLERAEAAVAAAQRRLTADLGDERAVDDEGSSGGGLVQRLDALRRRVEALQCPQCARLQEALERLSAERARSECALAQQAAALAAARRARAQLQAQHERERSGLRERARTLQRRLAALDSEYSAQLESLRAAYQQTTQSAVSTDAHGDGLRARYQQEIEQLRALCEKGLMAMESSHRRIVREMEDKHRAERDQLRLEKEQALAEETRATLAALDAMRKAHESEVRREVDKFKAEFLARGAPDLGQLSSRHQQEMEEIKREILSLSEKYSVKCVESAALEERLAATAAQLAHAHQHIMQLDARNKQLRAHIMSEANNELKNSEASSLAQLIEDAPAKSSRTSPMTSGGVFTSSDSARNMTLSPVQVTNIKPSL-