Monarch geneset OGS2.0

DPOGS207491
TranscriptDPOGS207491-TA1524 bp
ProteinDPOGS207491-PA507 aa
Genomic positionDPSCF300051 + 718843-721730
RNAseq coverage97x (Rank: top 61%)
Annotation
HeliconiusHMEL0112631e-2924.66% 
BombyxBGIBMGA010587-TA2e-2234.39% 
Drosophila% 
EBI UniRef50UniRef50_UPI0001CB9F4E3e-6733.20%UPI0001CB9F4E related cluster n=3 Tax=unknown RepID=UPI0001CB9F4E
NCBI RefSeqXP_002733851.12e-6933.47%PREDICTED: RETRotransposon-like family member (retr-1)-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912277615e-6833.47%PREDICTED: RETRotransposon-like family member (retr-1)-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|3485436325e-7232.64%PREDICTED: uncharacterized protein K02A2.6-like, partial [Oreochromis niloticus]
Group
KEGG pathway 
InterPro domain[274-380] IPR0211092.3e-07Peptidase aspartic
Orthology groupMCL19841 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207491-TA
ATGGCTACGAACGACGCAGTATCTGGAATAAAGCCGCCGGCTAACCTACAAATTGACTCGGACAAATCAACGTGTTGGAAAAACTGGTTACAACAGTTTGAGTGGTATGCAATCGCTGTTCAGCTAGATAAAAAGCCAGCCGATGTCCAGGCAGCAACATTTATGGCAGTAATAGGCCCAGAAGCGATCGAAATATACAATAGTTTCAATCTGAGCGATACAGAAAAAAAAAACCTGGCCGCCATCAAAAACAGGTTCAAAGAGTATTTTGCACCAAAAACAAATATATCATTCGAAAGATATATATTTTTTAAGATTGAACAAAATGAAGATGAATATTTTAACGAATTTTTAACCCGAATAAAAACTCAGGCAATCAAATGCGAATTTGACAATTTACTGGACGAAATGTTAAAGGACAAAATAGTTTTTGGTATCCGATCCAATCAAGTCAGAGAGAAATTGTTGACAGAGGATAAATTAGACTTAACCAAAGCTATTAATATCTGTAAAACTAGTGAACAAGCGTCCAAACAATTAGATGAGTTTGAAAGCAAAAATAAAACAGATAAAGTACTAGTAGTCAAAAACAAAAGTGTCAAAAACAAAGAAAATGAAAATTTTGACTGCAAAAGGTGTGGTTCAAATCACAAACGCAGAGAATGTCCAGCATTTAACAAACCCTGTACCAAGTGCAACAAAAATGGTCACTTTGCCAAAATGTGCCGCACAAAGAACTATAATCCTAAATTAAAAAATAAAGTGAACACTGTAGAAGAAATGTCAGATTGTTCAGAAGACGAGGTATATATATCAGCTGTAAGTGGTGGAGATAAGAAAGACTGGACAGAAACAATACAAGTTGGTAAAATAAAATTTGCTGCCAAATTAGACACAGGAGCACAATGCAATGTGTTACCGAGACACTTGATGAACAAACTAAAGGCAAGTCTGAAACCAAGCCGAACAAAAAATTTAATAAGTTATACAGAAGATAAAATGGCAGTTCTAGGTGAAGCAGAGTTACTATGTAAAATAAAAAATGAAAAAAGAAACATAATATTCAAGATAGTTAAAGAAAAAGTTACACCAATTCTAGGACTTGATACATGTGAAAAATTAGAACTTATTGCACGAGTGAAAACACTGAAGGAAAGTGAATGTAAAAATGATATCTTCGAAGGCTTAGGATGTTATAAAAACTTTGAATATGATATTGACCTTATTGAGAACCCGAGACTGGAAATTAGACCGTCAAGAAGAATTCCACATGCTATAAGAGACGAGGTAAGGCAAGAACTTGATAGAATGGTTAAGTTAGATGTTATTAAACCAGAAACAGAGCCTACCCCTGCCGTGAGTCCTATGGTAATTGTGAGACAAAAAGATCCCCAAAATGCTAAAGAATATTCATCAAGGACATCTAGGTATAAACAGCTGTTTAAGAAGAGCACGACAACTTTTCTATTGGAAAGGACAGTACCACGACATTATAAAACTAGTCAAAACCTGCTCCATCTGTGA

Protein sequence:

>DPOGS207491-PA
MATNDAVSGIKPPANLQIDSDKSTCWKNWLQQFEWYAIAVQLDKKPADVQAATFMAVIGPEAIEIYNSFNLSDTEKKNLAAIKNRFKEYFAPKTNISFERYIFFKIEQNEDEYFNEFLTRIKTQAIKCEFDNLLDEMLKDKIVFGIRSNQVREKLLTEDKLDLTKAINICKTSEQASKQLDEFESKNKTDKVLVVKNKSVKNKENENFDCKRCGSNHKRRECPAFNKPCTKCNKNGHFAKMCRTKNYNPKLKNKVNTVEEMSDCSEDEVYISAVSGGDKKDWTETIQVGKIKFAAKLDTGAQCNVLPRHLMNKLKASLKPSRTKNLISYTEDKMAVLGEAELLCKIKNEKRNIIFKIVKEKVTPILGLDTCEKLELIARVKTLKESECKNDIFEGLGCYKNFEYDIDLIENPRLEIRPSRRIPHAIRDEVRQELDRMVKLDVIKPETEPTPAVSPMVIVRQKDPQNAKEYSSRTSRYKQLFKKSTTTFLLERTVPRHYKTSQNLLHL-