Monarch geneset OGS2.0

DPOGS210872
TranscriptDPOGS210872-TA2499 bp
ProteinDPOGS210872-PA832 aa
Genomic positionDPSCF300027 + 1301898-1308012
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0222507e-9840.90% 
BombyxBGIBMGA006997-TA3e-14335.61% 
DrosophilaCG5626-PA2e-1733.14% 
EBI UniRef50UniRef50_UPI00022C97AC3e-2639.39%UPI00022C97AC related cluster n=1 Tax=unknown RepID=UPI00022C97AC
NCBI RefSeqXP_001121123.12e-2338.18%PREDICTED: similar to tRNA-splicing endonuclease subunit Sen54 (tRNA-intron endonuclease Sen54) [Apis mellifera]
NCBI nr blastpgi|3504121981e-2539.39%PREDICTED: hypothetical protein LOC100750076 [Bombus impatiens]
NCBI nr blastxgi|3504121982e-2437.63%PREDICTED: hypothetical protein LOC100750076 [Bombus impatiens]
Group
KEGG pathway 
Orthology groupMCL21941 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210872-TA
ATGAATATGCTAACTGGTTGTGAGTTAGTTCAGAGAGGTACTACAAAAATTGACGCCACATTACCTGAGCTGGGTGTCAAAGAGGTGACTCCAAATGGAACTTGGTTGGAGCAAAAACAGATACAGGCAGCTATAGAAGCTAGAAAAAAATTGATCGAAATACAACGTATCGAAAAAAAAGGAGCCATGAGTCATGCGGTATGGAAAGATAATTTAAAGCTTGCCGAAGTTACCCATAAAGTGGGTGGTCATTGGCAACACATGGGACACAACATAGGAAAGCAACTTTATATTAGAGCCGAAGAAATTTTATTTTTGATGGAACTTAATTGTCTATATTTCAAATTCAATGATGTGGTTGTGTCTCTACAGCAGGCATACTCTTTGTTTCTTGGACCACTAATATCACATCAACAGTACAAAGTTTATGCGTCACTCAGCCGTCTCGGATACCGTGTGTATAGGCATGAAACATCAGACAAAGAGATACATAATCCCAGTACTAGTAATAATGTTAACAAAGTCAATATTGGTAGCTCGGAAGCGAGTTGTTCGGTTGGGAATCCAACTATGGTAAACATTAAATTGGAAAGAATGGATGAATATGAAAATATACAAATCAAACAAGAACCGGACACAGAAGAATATAACGCAGGTGACGTAAACCATGGTATATTAAGCGTACCTATAAAGAAGGAATTTCCACAATCAGATAGTGATATGGAAACGAGCGCTTCAAACTCAATACAGGGCAGTAGCCAAGATAAATCAGATAATATGTGCTGCAAAATAAAACTTAAGAATTTGAAAAGCAGAAAGCTTAAACCGTGCAGCGGAAAAGTTTTACATAAGTATTTTAATAACCTCCCCGAGCTTATAGGAACATCAACGGTTATCGTTAAGAAACCAAATATTATGTATTTACCTATAAACGTTATATTGAAGAGAGTAGAGTATACTGTCAATTTATTAAACATAAGGGAAAAGGGCGAGAGGACGGACGTGACGGAAACAAATATATACAATGAAGGGGATGAGGTGAATGGAAGTCATGTGAGAAGGTTGAGGAGCTCAGCGAGCAGATCCGATACTGATGGTACGGGGCAATCCAATATGAGGTTCCCGTCACAGAACCCACAATACCGACCATACAACATGTGGAGAGACCGTCATAGCTTCAACTATTTCAACTTCAACATGTTCTTCCAAAGAAACTTCTCCCCGCCCTGGTACACAAACAGCAATTGTCAATTCACACCAAGAAACCACGCCTACCCTCGCCCGACACTGAGCGTACACAACCGCATCACGCCCCAAACCAGGAAGAGACCAAAAAATAACACACAAAAACAACATTTAGATGGTATTAAAAAACTTAGTGCCAAATTAAAGTTACTACTACAAAAAGGAAACATGGACCCAGTGAATATACAGTCGCTGCAAAGACTTATTCATCTTTACAACAGGCGTTACAAGGCAAGAATGAGGATAAGTCCGGACTTTGATATCGTCGACGAAAACGTATTGGACACCATCGACCTAGACGATGAGGAGCCGGCGGAGAAGAGGCGCAGAAACAATGACGACCAAGCGTACAAAGAAAATCTGGAACAGCTAAAACAACTGGCATCCAAACTAAAGAGTTTAGAGGAAAACAAGAAGTCATCTGCCAGACACCGAAGGGCTTTATCAAAGCTATTGAAAGTGTTCAACGAGTCTTATAAAGAGGAATATTATCTATCGGAAGAGCACGAAATCAAAAATCCAAGACATATAACTCTTGATACGTCAGAATCGGAATCGGACGTATTAATGAACGAAGACACTCCGAATCGCTCCAAAGGGAAAAAGGTCAAGAATCCGTTCAACATATTGAAGAGACTCACGGAGAAACAGAAAGATGGGAATACATCATCCACGAGCGACGATTGCGATGTACAGGAGTGTAAGAAGTATAGTGACGTGCTAGCCAAGACGTTTTCTAAAGGTTGGCTGCCGAGGGAGGATGATTTTGGTAGAGCCGAAGTTGTTAAGAAAGATTCCATAAACATGGCTTGTAATCACGACGAAAAAGATCTGAGGAGAGAGGAATTCATGTATGACTTCCTTAAGATACAATCTACCAAAACCGACGACTGGTTGAATCTCAAAAATGAGACAATCAAGTCCGAAGATAATTCCTTGGCATCTGTCTTAAGGAAACTGACTATCATCAGAAGAGATGACAGTCTCAATGATGAATGCAATCTTAAGATTGATTTCGATGTTTACAACCGAGATGTTCAAAACTTCAGGAAGACCAATCGCCCAACACCACATTTTCGTTTAATTGCCCTCGATGAATCATCAACCATACCATCCGGACAGGAAATAGCTACACTGGCATCTAAATACAATGACGATGTTGCCATCATATTTGCTGTAGTGGGCATGAATTCCATCGGTTTCATACAAATCAAGCCGACGGTGTTGCCAGTGTTCCTTCCAAATGAATCTTAA

Protein sequence:

>DPOGS210872-PA
MNMLTGCELVQRGTTKIDATLPELGVKEVTPNGTWLEQKQIQAAIEARKKLIEIQRIEKKGAMSHAVWKDNLKLAEVTHKVGGHWQHMGHNIGKQLYIRAEEILFLMELNCLYFKFNDVVVSLQQAYSLFLGPLISHQQYKVYASLSRLGYRVYRHETSDKEIHNPSTSNNVNKVNIGSSEASCSVGNPTMVNIKLERMDEYENIQIKQEPDTEEYNAGDVNHGILSVPIKKEFPQSDSDMETSASNSIQGSSQDKSDNMCCKIKLKNLKSRKLKPCSGKVLHKYFNNLPELIGTSTVIVKKPNIMYLPINVILKRVEYTVNLLNIREKGERTDVTETNIYNEGDEVNGSHVRRLRSSASRSDTDGTGQSNMRFPSQNPQYRPYNMWRDRHSFNYFNFNMFFQRNFSPPWYTNSNCQFTPRNHAYPRPTLSVHNRITPQTRKRPKNNTQKQHLDGIKKLSAKLKLLLQKGNMDPVNIQSLQRLIHLYNRRYKARMRISPDFDIVDENVLDTIDLDDEEPAEKRRRNNDDQAYKENLEQLKQLASKLKSLEENKKSSARHRRALSKLLKVFNESYKEEYYLSEEHEIKNPRHITLDTSESESDVLMNEDTPNRSKGKKVKNPFNILKRLTEKQKDGNTSSTSDDCDVQECKKYSDVLAKTFSKGWLPREDDFGRAEVVKKDSINMACNHDEKDLRREEFMYDFLKIQSTKTDDWLNLKNETIKSEDNSLASVLRKLTIIRRDDSLNDECNLKIDFDVYNRDVQNFRKTNRPTPHFRLIALDESSTIPSGQEIATLASKYNDDVAIIFAVVGMNSIGFIQIKPTVLPVFLPNES-