Monarch geneset OGS2.0

DPOGS210539
TranscriptDPOGS210539-TA2475 bp
ProteinDPOGS210539-PA824 aa
Genomic positionDPSCF300304 - 157250-161192
RNAseq coverage1220x (Rank: top 10%)
Annotation
HeliconiusHMEL0095490.071.65% 
BombyxBGIBMGA013462-TA0.067.24% 
DrosophilaALiX-PA0.049.22% 
EBI UniRef50UniRef50_Q9VB050.049.22%ALG-2 interacting protein X n=21 Tax=Neoptera RepID=Q9VB05_DROME
NCBI RefSeqXP_973055.20.053.48%PREDICTED: similar to programmed cell death 6-interacting protein [Tribolium castaneum]
NCBI nr blastpgi|1892343260.053.48%PREDICTED: similar to programmed cell death 6-interacting protein [Tribolium castaneum]
NCBI nr blastxgi|2700019050.053.98%hypothetical protein TcasGA2_TC000807 [Tribolium castaneum]
Group
KEGG pathwaytca:6618280.0 
 K12200 (PDCD6IP, ALIX, RIM20)maps-> Endocytosis
InterPro domain[4-351] IPR0043285.6e-110BRO1 domain
Orthology groupMCL14192 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210539-TA
ATGGCCGATTTGATAGTCGTCCCGTTGAAAAAATCATCTGATGTAGATATAGTGAAGCCTTTAAAAAACTTAATTCAATCCACATATAACACCGCAGAAACCAACGAAGATTTTAGTGACGCTATTAATGAGCTGAGTCGACTCAGAAGTTACGCTATATGGAAAGTGTTTGAAAAGTCGTCATTAGATGTTATTTACAGCTACTATGATCAACTTGTGTGTCTGGAGCGGAAGATCCCGCCGCAAGAAGTGCAGATCCCCTTCAAATGGAAAGATGCTTTTGACAAAGGTTCAATTTTTGGTGGCAGGATGAGTCTTACCATCAGTTCCTTGGCTTATGAACGAGTGTGTATAATGTTCAATATGGCAGCAATGCAAAGTGCTTTGGCAGCACAACAGACTCTTGATACAGAGGATTCATTGAAACTCGCTGCTAAGTTGTTGCAGCAATCTGCCGGTATCTTTGCCTACTTGAAGGCAAATATAATGCTGGTAGTACATCAGGAGCCCACACCCGACCTTAGTCCTGATACATTATCTACTCTCTCACAAGTGATGCTGGCTCAGGCTCAGGAAGTCATTGCATACAAGTGCATGAAAGACGAAATGAAGGAGAGTATGGTGGCGAAGGTGTGTGCGCAGTGTGAGGAGCTGTACACGGACGTGGTGCGCGCCCTGCAGAGAGACTCCAGGCAGTGGTTCGACCGCGACTGGCTCCTCACGGCTCAGAGTAAACAATCGATCTTCCGCGGTCTCGCGAACTACTATCAGGCGCAGGTTTGTCGCGCGAACAAATCGTTCGGGGAGGAGATCGCCCGGCTCAAGGTTGCGGTGGAGCTGCTGAGGGCGGGGCGAGAGGGCAGCCCCGCGGCCGAGGTGGGAGTCAAGGCGGCGAGGCAGTTGGCTGCCGCCGAGAGGGACAACGACTTCATTTATCACGAGCGGGTGCCTGAGGCGCGCGCTCTGGAGCCGGTGTCGCGCGCTCCCGTGGCCTCGGCCCTGCCGCCTCAGGAGCGCTGGTCTCCCGGGCGACAGGATCTGTTCGAGAAGCTGGTGCCTCTGGCCGTGCATCAGGCGCTGCAGGCCACGGACGCACGCCGCGCCGACCAAATCGGGGCGGAGATCAACGCTCTTCGGGAGGACACGCAGATGCTCAACAGCATTCTGGCGTCCTTGGACCTGCCCGCCTGCATCGAGGCGGCGGGTGGCGGCGGCGGACTGCCCGACTCCATCCGAGCCAAGGCGGCGGCCGTCAGAGACGCGGGCGGACTCCCCGCCCTCGAGCGGCTTATGGCCGAATTACCGGAGCTGTTACAGCGGAACCGGGAGATACTGACGGAGGCGGAGCGCATGCTGAGAGAGGAAGCGGAAGCGGACGCGGCCTTGAGAACGCAGTTCGGGACGAGGTGGACGCGGACGGAGTCGGACAAGCTGACGGAAGCGTTCCGCGCCAACGCTTCGAAATACGCGCAGATCATCGACAACGCCGTCCGCGCCGATAGCATCGTGCAGCAGAAATTCCATCAGCACAAAGAGAGTATCGAGATGCTGAGCCGCAGCGACCGAGAGATAGGAGCGGACGTCCCCGACGCGCCGGACTCCGTGGCCACGGACGCGGACGCGCTGCGGACTCTGAGGACGCTCATGCAGTCGGTCGAGGAGCTGAAGGCGGAGCGCGACGCCATCGAGTCCGAGCTGAAGAGTGCGACGGTGGACCTCCGGGAACAGTTCATGTCAGCGCTGGCCGCGGACGGTGCCGTGGACGAGGCGCGGCTGTCGGCGGGCGCGCTGGGCGCCGCTCTCGCCCCGCTACAGCGTCGCGCCGCCGCCGCGCGCTCTCGGCAAGAGGAGCTCGCGGGGCGGCTCCGGGAGGCACACGACGCCCTCATGGCGGCCCGCGGGGGAGCCTCGGGGAGGGAGAGGGCCCTCACGAGACTCAGCGCGGCCGGGGACGCCTACCAGGACCTTACCAACAATCTCAAGGAGGGCGTCAAGTTCTACAACGACCTCACTCAGCTGCTGGTGGCCTTCCAGAACAAGGTGTCGGACTACTGCTTCGCTCGCAAGACGGAGAAGGACGAGCTGCTGAAGGATCTCACGCAGGAGGCGTCTCGCTCGTCCCCGCGACCCGCCCCCGCCCCGCCGCAACATCACGTGGAGGCCAACGAGCCGGCCGTAGTGTCCAAGCGTGAGCCGCCTCCCCGACCTCCCCCGCCCGCCGCCGCGCCCGCCGTGTCCGCGCCGTCGCTGCCCTACCCGCAGCAACCTCAAGGCATGCCGCTGCCGTACGGTGCGGCGCCGTCTCCGTATCCGGTGTACGCCCCGATGCCGGCCATGTACAACCCGTACGCGACGCTGCCGTACCCGCACCACCGCGGCGCCGCGCCGTCGTACCAGCCCTACCAGTACCCCCCTCCGGCGGCCGCCTACCCCCCGCCGCCGCCCGCCGGGTACAACCCCTACCAGCAGCAGTAG

Protein sequence:

>DPOGS210539-PA
MADLIVVPLKKSSDVDIVKPLKNLIQSTYNTAETNEDFSDAINELSRLRSYAIWKVFEKSSLDVIYSYYDQLVCLERKIPPQEVQIPFKWKDAFDKGSIFGGRMSLTISSLAYERVCIMFNMAAMQSALAAQQTLDTEDSLKLAAKLLQQSAGIFAYLKANIMLVVHQEPTPDLSPDTLSTLSQVMLAQAQEVIAYKCMKDEMKESMVAKVCAQCEELYTDVVRALQRDSRQWFDRDWLLTAQSKQSIFRGLANYYQAQVCRANKSFGEEIARLKVAVELLRAGREGSPAAEVGVKAARQLAAAERDNDFIYHERVPEARALEPVSRAPVASALPPQERWSPGRQDLFEKLVPLAVHQALQATDARRADQIGAEINALREDTQMLNSILASLDLPACIEAAGGGGGLPDSIRAKAAAVRDAGGLPALERLMAELPELLQRNREILTEAERMLREEAEADAALRTQFGTRWTRTESDKLTEAFRANASKYAQIIDNAVRADSIVQQKFHQHKESIEMLSRSDREIGADVPDAPDSVATDADALRTLRTLMQSVEELKAERDAIESELKSATVDLREQFMSALAADGAVDEARLSAGALGAALAPLQRRAAAARSRQEELAGRLREAHDALMAARGGASGRERALTRLSAAGDAYQDLTNNLKEGVKFYNDLTQLLVAFQNKVSDYCFARKTEKDELLKDLTQEASRSSPRPAPAPPQHHVEANEPAVVSKREPPPRPPPPAAAPAVSAPSLPYPQQPQGMPLPYGAAPSPYPVYAPMPAMYNPYATLPYPHHRGAAPSYQPYQYPPPAAAYPPPPPAGYNPYQQQ-