Monarch geneset OGS2.0

DPOGS207874
TranscriptDPOGS207874-TA2022 bp
ProteinDPOGS207874-PA673 aa
Genomic positionDPSCF300101 - 169465-177807
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0102360.074.56% 
BombyxBGIBMGA008349-TA2e-16569.19% 
DrosophilaCG3081-PA1e-2937.50% 
EBI UniRef50UniRef50_B4L7035e-4934.22%GI16070 n=2 Tax=Drosophila RepID=B4L703_DROMO
NCBI RefSeqXP_002011307.19e-5034.22%GI16070 [Drosophila mojavensis]
NCBI nr blastpgi|1951337602e-4834.22%GI16070 [Drosophila mojavensis]
NCBI nr blastxgi|2700056749e-5133.01%hypothetical protein TcasGA2_TC007771 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL18260 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207874-TA
ATGGCTTACAATTTCGACAGCGGTCAATTTAATAAAGAGTCAACTCCGAAACGATTAGGAAACTGGCAGGTCCCACGTTGGGCGCCAACACGGGCACGCCCTATGGAAATGAGGCCTAACCCTAAACCCATTTGTGACCTCAACGGACATTTCTTACCCGGTGCACCTCGAGGATCATCTAGATGTTTCGGACACTTCACTGGAACTTGGGATTTACCGAGAAAAATAACCAGAAAAGTTGCCGAGGAACTCGCGAAGGAGCCTCCGGACAGCAGATTTCCAGATTGGATAAATCTTCGCGTCCAACGCCCGAAGGCTGCATCACGCATGCGTGGGGGAAAGCGCGGTAAAGAAAAGGAACAACAAGAGGAAGACGAGCAGAGTTTTACAGATAAGGATAAAGGTCCACAGAAACAAAATAACTGTTTAGAAACTAAGGCTGAAGAGAGTGAAACAATCAATGACCCTTTGGAATTGAATTCAGAAAAACAGCCGGTTGATAGTGACGGTCACGGCCCTTACATTCCCGGATCACAAACAGACGCCGACAAGCGGCATGATAGTTATGAAATTTTAGTTAAAGATGCTGTTGAAAAACTGCATGGACCACAGGAAGATCGAACAGAAAAATTTTGGAGGAAAGTAGCAGACAGTGCAAAAAAAGATGACATCGATGCGGAGAATATGAAATATTCCAAAAAGAGCGACACACCAAAGCTGCCGTACGTGAATGTTTCCCAACATTTCAATTTTGCAAAGAAAATGCACGCAGGTAACTTAGAACACGAACCTCTTCCAGATACTCTAGGGTATAGAAATCTGACTAAAGTAGGAGAGCACAGAAACTCCGATATCATTGTAAAACCTTATGCAATTGGTTGGAAAGGTTATGGAGCTTCAGGACCAACATGTTGTACAAAAATGCGAGTTCATCGACCAAAAACGTGTGATCCCAAGAAAAGTGATGAAAAAATAGATGAACGGAAAAATCGTCCTGCAACGTCCGATCTAGGGAGCCAGTATAAAAAACCGCTTTCCTTAATGGATCTCGCTATATGTTGGGACTTTAAACCTGATAATCCTCGAGTAGAACCTAAAGCGCCTAAGCATATAGATGGTTCTAATGGTTCAATGGCTCCAGCAGTCTTTACAATGGTGAACACACCAAAAGAAGAAGTAAAGGAAACTGTTTTAGGACACACATATCCAATTTTTAACCAAAATCGAGATCTAGGTGATATAGGTCGTCATAGTGTCCCGCATTTTGAAAAAGATATGACTAAAGAAAAAAGAAAAGAAGTTTGCGATGTACAGAACTGCCCACAGCATAATGATATTGAAAAAGGTATAAGTTCTCAGTCATCAAGTAGAAAAAGTTCGGCCTTTTCCGATCATAAAAGCCTATGTGATGGAATTAATGGTTGTGGCACTACCTCTAATGGTAGTCGTAATTCAAACCGCGAAAACACCAAAATAAAAGTAAACCATGCGAAAGATGTATGGAATGCTAATGACAAAAACAATTCAAAGTATGATATTAATCGCAATCGCCATAGTTTGGAGTCTTATGAGAAGACTCCACTGGCTCAAGGTAAATTACATCAGAGCTCACCTAACGCGTCACAAATATCTAAACAATCCTCCAAAGAGCCTACAAAATCTAGCGAATCTTTGGAAGGCAACGTCAAACATAAAAATTGCTTATCCTGCAATGGAGTAAAGATCCACCCGGAAAAAGTTAAGCAAAAAGATGATTATAAATTTGCCTTTAAGGCTGGAAATCCAAATTCCATCCAGAGCAATAGTACAAACGATTCAAAGGAATTGAAAATTCCGAAAATGCGTCACCCCTATACCAAAAAGTCTTACACGATTCCAACTTTAGCACCTCCATTTAGCATTTGGCGAGATGCAAACGCTACCGGTTATCCGGAACACTGGAGATTGGCAAGTGTTTACCAACATGCTTATAAACCTCCCGAACAAAGACGAAAACCATTGATAGAAAGCGTTTATCAGTAG

Protein sequence:

>DPOGS207874-PA
MAYNFDSGQFNKESTPKRLGNWQVPRWAPTRARPMEMRPNPKPICDLNGHFLPGAPRGSSRCFGHFTGTWDLPRKITRKVAEELAKEPPDSRFPDWINLRVQRPKAASRMRGGKRGKEKEQQEEDEQSFTDKDKGPQKQNNCLETKAEESETINDPLELNSEKQPVDSDGHGPYIPGSQTDADKRHDSYEILVKDAVEKLHGPQEDRTEKFWRKVADSAKKDDIDAENMKYSKKSDTPKLPYVNVSQHFNFAKKMHAGNLEHEPLPDTLGYRNLTKVGEHRNSDIIVKPYAIGWKGYGASGPTCCTKMRVHRPKTCDPKKSDEKIDERKNRPATSDLGSQYKKPLSLMDLAICWDFKPDNPRVEPKAPKHIDGSNGSMAPAVFTMVNTPKEEVKETVLGHTYPIFNQNRDLGDIGRHSVPHFEKDMTKEKRKEVCDVQNCPQHNDIEKGISSQSSSRKSSAFSDHKSLCDGINGCGTTSNGSRNSNRENTKIKVNHAKDVWNANDKNNSKYDINRNRHSLESYEKTPLAQGKLHQSSPNASQISKQSSKEPTKSSESLEGNVKHKNCLSCNGVKIHPEKVKQKDDYKFAFKAGNPNSIQSNSTNDSKELKIPKMRHPYTKKSYTIPTLAPPFSIWRDANATGYPEHWRLASVYQHAYKPPEQRRKPLIESVYQ-