Monarch geneset OGS2.0

DPOGS211800
TranscriptDPOGS211800-TA1527 bp
ProteinDPOGS211800-PA508 aa
Genomic positionDPSCF300031 - 1063947-1075842
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0063203e-12683.66% 
BombyxBGIBMGA006123-TA1e-14878.48% 
Drosophila% 
EBI UniRef50UniRef50_G5BAX92e-5531.78%Endonuclease/exonuclease/phosphatase family domain-containing protein 1 n=28 Tax=Euteleostomi RepID=G5BAX9_HETGA
NCBI RefSeqXP_975030.29e-8539.72%PREDICTED: similar to importin alpha [Tribolium castaneum]
NCBI nr blastpgi|2700077675e-10041.37%hypothetical protein TcasGA2_TC014464 [Tribolium castaneum]
NCBI nr blastxgi|2700077671e-9741.18%hypothetical protein TcasGA2_TC014464 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL19521 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211800-TA
ATGAACGTGAACTCCGCCACCGAGGAACAGCTGATGACATTGCCGGGGGTCTCTCGCCAGCTCGCGCGGGAAATTGTCCGACACAGACAAATGATTGGCAGATTCAAGCGCGTCGACGACCTTGCCTTAGTGTCAGGTATTGGAGCTGAAAAGCTTGAGTTGTTAAGACCAGAAATATGTACAAATTCCAAAAGAGAGATATCAAGGGCAAGTTCCTGTACTCATTCCTTAGACAGTGTAAGAATTACAAATGAAAATAAACTATGTTCTGTGAATTCATCCAGTGTATTCCAACTGCAATGTGTGCCGGGACTGAATCAAGAATTAGCTGCTAATATTGTAGATTATAGAAATAAGAAAGGACCATTCAAATCATTGGATGACTTAATAAAAGTCAGAGGCATAGATATTGTCAGGCTGAGTACTGTTAAACCACATTTAAATTTGGAATTACGCAAGAGTGAGAGTGTGCAACATTTAACTAACGGACATGTAAATGGTTGGAAAGAGACATCTCTCGATGACTCATATCTAAACAGGGAAACCAAATCACTAAGGAGTCCTCATAGAAAAAGTATGTCTATGCCTACAAAGTTCCCAATCACATTGCCAAATGGTTTTGCTACAGCGCCGGTGAATGATATATTAGATTTGCTATCAGCCTACTCTCACCGTCCCATTGTGGAGGAAGTCTTCAGATATGAGAGGGATGGAGTGAGATGCTGTCGTCTCGCATCTTGGAACCTCCATCAGCTCAGTGTCGATAAAGTCACAAATCCAGGTGTCAGAGAAGTGATATGCCGGACCATTTTAGAATATAGATTGTCAATTGTAGCTATACAGGATGTGTGTGAGGAGTCATCTCTACGTATGATATGTGAAGAATTGAACTCACCGGCTCTAAGGAGAGTGACTGAGTGGAGGTGGAATAATAGGTCTTGGAACTACTGCTTACCGAGTGATGGAAAAGGAAGCAGCCTCGGCTTCTTATACGAGAGATCCAACAAACACGTGTCCGTGGAGGAAGTGACGCGCGCGAAACGAGACGTCATCTCGGAACACGCTGCGAGGATCCTGGAGACAGTTGACAGTATTAAGAGTGACAGATTGCTACTATTCCCACAGGTGTTCCTACTAAATGACAGGCCATTAATAATGTTGAACGTCCAATGTAAGGACCGTCTGAGTGAAGAGGAGAGCAACAAACTGAGAGAGATAGCTGACATGGCGCTCACTTCAAAATTACAATTAGCTTTCTTCGGGGATTTTCTGAGTTGGAAAAATGTACAATGTTTACGTAACTGTGAATCAGTTTTGGACACGGCGATAGTGTCGTCCTTGGATCCGAGCGTCGCGGGTCAGTGTGCTATATTGTGTGTAGGGGAGGTCGAGGGCAGCTCCTTCAACGGACACGCGGGCGTCGTCAAGACAGGCCTCTGCCACCTGGCCATACCTCGCGGCTGGTCGTGGGGCGGACCCGCGTCTCCATTCTGTCCGATATGGGCCGAACTGAATGTACCCGATTGA

Protein sequence:

>DPOGS211800-PA
MNVNSATEEQLMTLPGVSRQLAREIVRHRQMIGRFKRVDDLALVSGIGAEKLELLRPEICTNSKREISRASSCTHSLDSVRITNENKLCSVNSSSVFQLQCVPGLNQELAANIVDYRNKKGPFKSLDDLIKVRGIDIVRLSTVKPHLNLELRKSESVQHLTNGHVNGWKETSLDDSYLNRETKSLRSPHRKSMSMPTKFPITLPNGFATAPVNDILDLLSAYSHRPIVEEVFRYERDGVRCCRLASWNLHQLSVDKVTNPGVREVICRTILEYRLSIVAIQDVCEESSLRMICEELNSPALRRVTEWRWNNRSWNYCLPSDGKGSSLGFLYERSNKHVSVEEVTRAKRDVISEHAARILETVDSIKSDRLLLFPQVFLLNDRPLIMLNVQCKDRLSEEESNKLREIADMALTSKLQLAFFGDFLSWKNVQCLRNCESVLDTAIVSSLDPSVAGQCAILCVGEVEGSSFNGHAGVVKTGLCHLAIPRGWSWGGPASPFCPIWAELNVPD-