Monarch geneset OGS2.0

DPOGS203097
TranscriptDPOGS203097-TA3834 bp
ProteinDPOGS203097-PA1277 aa
Genomic positionDPSCF300391 - 4485-14509
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0115690.071.08% 
BombyxBGIBMGA011154-TA4e-7839.10% 
Drosophilagek-PA2e-3370.00% 
EBI UniRef50UniRef50_UPI00015B43DF7e-6725.95%UPI00015B43DF related cluster n=1 Tax=unknown RepID=UPI00015B43DF
NCBI RefSeqXP_001601592.11e-6725.95%PREDICTED: similar to congenital dyserythropoietic anemia type I (human) [Nasonia vitripennis]
NCBI nr blastpgi|1565399493e-6625.95%PREDICTED: codanin-1-like [Nasonia vitripennis]
NCBI nr blastxgi|1565399495e-7023.92%PREDICTED: codanin-1-like [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL25816 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203097-TA
ATGCCTGAATCAATAATAGAGAGTGTCTTATCTGGTAAATTACAACGTGATTTATTGTTCAAATGGTTGAACGACGAGCTAGTTGAGACAGCGCCAGATGATTGTATATTGCTATGTTCTAATAGGAACGAATTCGTTTCCTATTTTCTCTCTTACCTCAGAGACCACACTGAGAGTATTCTACAGACAAACACAAACTCGTTGCTACCTCCATTCCAAGGGACTCCGGAGAAAAATAGGAAACATCATCGTTCTATCAGCGATCCGACCTGTGACAATGAAAAATCAAATGACAGCGCACATAAATCAGAAAGAAAGACAGAAGAGTCACCGAACAGAGATAAAAAGAGAAACAGAAGAGTTAAAACGAAACTCTTCTCTGACGACAGAAGCAAAGACAATCTGAATTCATCGGACGAATTACGAATAAGCACTGGAATTGATAGACTGATGTTATCTAGTACCCCGATGAAAAATGGCTTAAGGGACAGCCCAAACTCACCAGCCAGTCCTGTGACGCCGCAGTCGAGGTCGTTTAGGGAATATGAAAGGTGTGACACCCCTAGGTCTGGAAGGCACTCAAGATCTCAAGACAAAAACGTCAGCCTAGCCGATTACCTAGTTAATATCCAACCGAAAAGCACGAAGAAGAAACGATCAAAGAACACGTCCAATGACGACAGCGATCCTAAATTGGATCTGGATTTAAGCAACTCGGAAATGTTCCCTGAGATAGGTGCGAGGAAATCAAGCTCGCTGAAATCTGAAAGGCGTAGGATTAAACCGACGAACATAGATAAGAACAAGAAGAGTTTCTCGCTCAACAGCTTCAACGCTGAGAACTTCCAGCAGCCCTCGCCATTGGCGCTGGAAGAGAACCTGGCTTTCAAACCCAAGTTACAGCCAAAGGAATCATCAAACTCCTTCGATGCTGAGAGGAATATACTTAAGCAGGAGAGGCAGAAGTTAATGGAGAAATTCAATGTTCTGAACACAACCACAACTCCCATAACACCTTCCCAGGTTAAGATACTGCGGAAAGAAGAATCCAGTACAGCATACATCGAGGCTGACAGCACTAAGCTAACCTTCAAAGAAAAAATAGACACATTAGTGGACATATACGATGTTTTATTCAAAAATAATCTAATACTATGCATAAATACGGAAATATATTTCTTAATAACAATTCTTCTGTCAAAACAATGCGAAGACGACATCAAAACAACAGAACATCTGCTAGAAACAGACTTGGCTAACAACATTTTGAAGCCCATACATAACAGCACATATTTCGCTGTGAAATCCCTGTGGAATCAGCGTATAATCCTGGAAGTGATCCTAGACAAGAACTCCCTGAAGATTCTGGGGGAGAATAAGAAAGTGCGCAGCTTCTCGCCAGATTTAGCCAAATTCCTGCTGAATTCGTACGGCCTGAAATGCGAGGCTGAGTCCCAAGATAGGTCGAAAACTGTGACCCAGAACAGATGCTCCAACGGGATCATATGTTTCAATCACGAGACGGATAACGCTGAGAACTTCCCGTCCATATTGAGCTTCCAGAATTTCAAAAAACAGAGGGACATGTTTTACGAAATATTAAGGTGGTATCAGGACACTCAAAGCACTGGTGTATCTAGATCGACGATGCGCGCTCGGACACGCGCTCTGATATCAGCTGGTCCAAGCGCCGCTAACCACGCTCACCTCTCAACACTCATCACACACATGCTGGTGGACACGGTGCCTGACAACGAACAGGAATCGAAGTTAAGTAAGCTTCAGCGGCGTCTGACATGCTCTTCATCATCAGACTCTAATTCACTGCCAAGATTCACAGACAGGGAGATGTTTTACAAGTTAGTATTGAAAAAGGGTGTCATAGACGGGCACACACATGCACACGCACATACACACACACACACACACACACATACAAACATATACATACATACATATATATATGTGCCTAAGCCCGGCGGGGTCAAGAAGGGTTGGATGCGCCAGTTCGTAGTGGTCTGTGACTTCAAACTATTCCTCTACGACATATCTCAGGACAGGAACGCGGTGCCCTCAGTGTGTGTGAGCCAGGTCCTCGACATGCGCGACCCTGAGTTCAGCGTGACCTCCGTCAAAGATTCGGATGTCATACACGCCAGTAAACGAGACATACCCTGCATATTTAGGAGCCAGCCGTCTATGGATATCAAAGGTTTGCTGCTGAACGCCAATCAAAATGGCCGTCTGACTATCACGATACCCTGGATAGTGCACTACTTATCGATGTTGGACTACACTACCCTAAGACTGAAGTACTATCAAAATATATTAAAACTACTTTTCGATATACATAGGAAGTTAAATATAACAGCATTCAAGAAGAACACTGTGATATTCCTTAAGCTCAACTTAGGCTGGTTGTTCGATCTGCCCCATGTACCACAAGAGGTGTTTTACGCTAAGAATGACACTAGTGTTGGTGTTGCCAGTGTAATTGATTGTGAATTTGATATCGAGGAACATGTGTTGCAAGAACTCTGCCCCTACTTGAAGGATCTCAGTGTATTTTTAAGTACATCCCGAGTTAGCCAGGATTCGAACCAGTTTGGCAGCTTCAGACATATAACACCTGTGAGTCTCGGTTTAAACAATGAGGATAAGATCCGGAACAAGGAGAAGGAACTGCAGATGCGTTTGGAAGAGGAGCTTATCAAAAGCCAGCCATCTTCAACTCGGAGAGTGTTAGAGCTGGTGACAGAGAGGGTTACATCAGCAGCGATCAAGGAGCTCACTACAAATACCCTCGTGGAAGCCAGGAAGAAATCCAGGGCGGGCGCTGCGGCTATAGTGGCTGGGTGCAGAGACAGGTCGGGTCTACTCTCGTCTCTACAGAGCCACTATTCCGCCCAGTTATGCTCTCTCAATGCCAGCGCGCTCTCTTCCTCCCGGACCTTGATCAAGGCTCGAGTGACGTCAGCACTGGCTGCCCTGCTGCCGACCGCCCCGCCCCCCCTCAGGGCGGTGGTGGGGGGAGCCTGCTGCCAGCGGGTGGACAAGTGGCTGGCCCAGCATTGGACCAGCACTGATATCCTGTGTAAGGACATCAAGAGCGAGATGGAGAACCTGTGCAGTGTGGGCCCCGGGGTCTCCGTGGCTGGGGAGGGCGTTGACCTGGTGACCTGCAACCTGGACGAAATGATGAGTCCAGCGCAGGCTATACTCGACCTTAAGGAACAGATCTGCGTGACCCTGGAAGGCGGCGCCCCCTGCCTGCAGGTGCTCCGCTCGTGTACCTTGGCCTGCGACCCCAACAACGTGTTCTGTCGCGCCCCAACCCTCACCGCGATACTACACCTGTCCGTGGATTTCTGTGTTGTTTACGTGAGCAGACACGCGTCCAAGGTTTTGAATATTCTGCCAGAGTTCAAGGCCCTTTGGGAACAGTGCTGCCCTCACAGGAGACGGGAAGAGCGAGAGGAAGACGATAGCAGGGAGAGGGCACCAGATCTCACAAAGCATGAGGACTGTTACTTCGACAGGATTCTGTGTCCCAGGAACATTATGTTGTTGAGCGAGACAAGATCCGGTGACGTGTGGCCGGCCATGGCTGACGTGTTAGTTTATTTGTTGAAGCATAACTACTTAACCGAGGACAGCCTGACCGAACAGTGCCTGGCTGTTTACAAACAGGATTGGCCCCAGAACATTCTTGAGAATCTATCGACATGTATGAGAAAGGTGTCCTCCAAGTGGCCGTCAACGGGCAAGTTTACTTTGTTCCTTGATTTCTTAGCAGACTTCTGTAACGATATGGACTATGACTTAATAGAGTAA

Protein sequence:

>DPOGS203097-PA
MPESIIESVLSGKLQRDLLFKWLNDELVETAPDDCILLCSNRNEFVSYFLSYLRDHTESILQTNTNSLLPPFQGTPEKNRKHHRSISDPTCDNEKSNDSAHKSERKTEESPNRDKKRNRRVKTKLFSDDRSKDNLNSSDELRISTGIDRLMLSSTPMKNGLRDSPNSPASPVTPQSRSFREYERCDTPRSGRHSRSQDKNVSLADYLVNIQPKSTKKKRSKNTSNDDSDPKLDLDLSNSEMFPEIGARKSSSLKSERRRIKPTNIDKNKKSFSLNSFNAENFQQPSPLALEENLAFKPKLQPKESSNSFDAERNILKQERQKLMEKFNVLNTTTTPITPSQVKILRKEESSTAYIEADSTKLTFKEKIDTLVDIYDVLFKNNLILCINTEIYFLITILLSKQCEDDIKTTEHLLETDLANNILKPIHNSTYFAVKSLWNQRIILEVILDKNSLKILGENKKVRSFSPDLAKFLLNSYGLKCEAESQDRSKTVTQNRCSNGIICFNHETDNAENFPSILSFQNFKKQRDMFYEILRWYQDTQSTGVSRSTMRARTRALISAGPSAANHAHLSTLITHMLVDTVPDNEQESKLSKLQRRLTCSSSSDSNSLPRFTDREMFYKLVLKKGVIDGHTHAHAHTHTHTHTHTNIYIHTYIYVPKPGGVKKGWMRQFVVVCDFKLFLYDISQDRNAVPSVCVSQVLDMRDPEFSVTSVKDSDVIHASKRDIPCIFRSQPSMDIKGLLLNANQNGRLTITIPWIVHYLSMLDYTTLRLKYYQNILKLLFDIHRKLNITAFKKNTVIFLKLNLGWLFDLPHVPQEVFYAKNDTSVGVASVIDCEFDIEEHVLQELCPYLKDLSVFLSTSRVSQDSNQFGSFRHITPVSLGLNNEDKIRNKEKELQMRLEEELIKSQPSSTRRVLELVTERVTSAAIKELTTNTLVEARKKSRAGAAAIVAGCRDRSGLLSSLQSHYSAQLCSLNASALSSSRTLIKARVTSALAALLPTAPPPLRAVVGGACCQRVDKWLAQHWTSTDILCKDIKSEMENLCSVGPGVSVAGEGVDLVTCNLDEMMSPAQAILDLKEQICVTLEGGAPCLQVLRSCTLACDPNNVFCRAPTLTAILHLSVDFCVVYVSRHASKVLNILPEFKALWEQCCPHRRREEREEDDSRERAPDLTKHEDCYFDRILCPRNIMLLSETRSGDVWPAMADVLVYLLKHNYLTEDSLTEQCLAVYKQDWPQNILENLSTCMRKVSSKWPSTGKFTLFLDFLADFCNDMDYDLIE-