Monarch geneset OGS2.0

DPOGS210822
TranscriptDPOGS210822-TA2736 bp
ProteinDPOGS210822-PA911 aa
Genomic positionDPSCF300027 - 487264-492220
RNAseq coverage306x (Rank: top 37%)
Annotation
HeliconiusHMEL0127615e-16085.30% 
BombyxBGIBMGA007137-TA0.062.95% 
Drosophilador-PA1e-15437.13% 
EBI UniRef50UniRef50_E2BRY60.051.78%Vacuolar protein sorting-associated protein 18-like protein n=13 Tax=Endopterygota RepID=E2BRY6_HARSA
NCBI RefSeqXP_974055.10.051.91%PREDICTED: similar to Vacuolar protein sorting-associated protein 18 homolog [Tribolium castaneum]
NCBI nr blastpgi|2700124530.046.61%hypothetical protein TcasGA2_TC006604 [Tribolium castaneum]
NCBI nr blastxgi|3800275710.052.17%PREDICTED: vacuolar protein sorting-associated protein 18 homolog [Apis florea]
Group
KEGG pathway 
InterPro domain[234-385] IPR0078101e-41Pep3/Vps18/deep orange
Orthology groupMCL13704 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210822-TA
ATGACTTCAATTTTTGATCAATACGAACAAGCGTCTCAGGTGTCGCAACGCCTAGTTCCTCCTTCAGAACAGATGACGTCTTCAGGTTACATCAACATCCAACTGGATGATAACAAGCCCATGTTTTCTAAGAGACAAATGAATTTCACGCCATCAGATTTAATCACTCACGTTGCTGTTTCTAGTGATTATCTAGTTCTAGCCATGGCAAATGGAATGATATTTAGACTAGATAGTGCAGAGGCAGCATTGGCTGGTCTTGGTGGTGTATGTGAGTTGTTGTACCGCCGATACGCACCACCTTCCTCCGCTGCCACCGCCCTGAGACTGCATCACTGCTCTTTAAGACTATTGAGACATCTGGCTCACCATAACGCACACATGCACCGAGCACCTATAGAGTATATTTTTGATATAGGTAAGGGAACTGACACACCCATTACTGGTATCCAGTTCCATAGAGTGAACAACACAACCAAATTCTTCATATTTGTCACAACCCCCAAAAGGCTGTATCAATTCATCGGCCATGCTATGGCTTCCGATGAGAAACCATTCCTGCAATCAATATTTCACTCTTATCTAACAACCGTAGAGACAGGCTTCCATGAAATACCTTCAACTCTGAAATACTCCAAATTGCAGTTCTTTTTTGATAAAACAAATAGTCCAAAAACATTTGCCTGGTTGACGGAGCCAGGTATATTTTATGGGCAGCTAGATCCTACTTCTCAACAGAATTCCAACTCACTGTTCACTCAAGGCGAGCTCATAACTTACTCTGATAAAAGTGAAAAAAATGACACCAAAGAAGCAACGCCACTCTCATTCGTACTTACAGAGTTCCATGTCCTCCTCATGTATTCTGACAGGGTCAAAGCGGTGTCGCTGCTGAACCAAAAACTGGTATACGAAGACAGATACTCAGAAGTACATGGAAAGTTGAAGAATATAGTGAAAGATCCTATCGGAAAAACGATTTGGACCGTGACCGATAAAGCCGTTTTTAGATATAAGGTCGAGAGGGAAGAAAGAAATGTTTGGAGGATATACTCTGATAAGGAACAATTCGACCTGGCCAAGCAATACTGTCAAAATAATCCAGCCTATATAGATATAATAAACGTGAAACAGGCAGAACTATTGTTCAAGAAAGGCGATTACGATAAAAGTGCTGAAATATACGCGGAAACACAGAGCAGCTTCGAGACTGTTTGCCTCAAGTTTTTGGAATGCGATCAGGTTAACTCGCTGAAGGTGTACCTCAGTAAGAGATTGGACACTTTGGACGACGACAAGACCCTGATATCGATGATAGTTATTTGGATGACGGAGTTGTTCCTGTCGCAACTCGGGTCGCTCCGTCGCACCGGGAAAGCTGACTCAAACGAGTACCATCAGATCCAGAGCAATTTCGAGATCTTCCTTCTCCAACCCAAGGTCACGAAATGTATGCAACACATTAAAACTGTCATTTACGATCTGATGTCTTCACACGGAGATAAGCAGAACCTCATCAAGTTGACTATCATCAACGAGGACCACGAGAACGTAGTGGCGCAAAATATTTACGAGAAGTCGTACGTACAGGCTCTGAACATGCTGCAGCATTTGAAAAAACCCGATCTATTCTATCAGTTCGCTCCGGCCCTGATGGAAGAAATACCGAGAGAAACCGTCAACGCCTTGATTTCTCTTGGACCGATTCTAAGTTCCTCAAGATTGTTGCCGGCGTTCCTCTCCTGCGAAAACGACGAGGCTCATGTATCTGAAATCATTCGATACTTGACATTCATGCTACAGAATTACAATGTCAAGGATCGTGCGATTCATAACTATCTGTTGACGCTGTACGCGGAACACGACGTGCCGGCTCTCATGAGATATCTGTCACGGCAAGGGCAGGAGCTGTCGATGGTGAACTATGACGTACATTATGCCTTACGACTCTGTAGAGAGAAGAACTTAACGGAAGCGTGCGTGAAGCTGTCGGCGCTGCTCGGCCTTTGGGAGTCGGCGGCTGAATTAGCGCTGCAAGTTGATACGGGCCTGGCCAAGACTGTGGCCGACATGCCTGATGATGTGACGCTGCAGAGGAGATTGTGGCTTGGAGTCGCGGAACACGTTATCACCAAGAACCAGGACATCAAGGTCGCCATGAGTCTTCTCGAAGAATGTCCTCTGATCAAAATCGAAGATATCTTACCATTCTTCAGTGACGTCATTACTATTGACCATTTTAGGGAACCCATCTGTCAGTCCTTACAGGAATATAACAATCAAATAGAAGAACTCAAAGCGGAAATGGAGGACGCCACGAAGTCAGCCGAGTATGTCCGCAGCGAGATCCAGTCGTTCCGTGGTCGGAGCGCGTTGGTGTGTTCGTCAGACACGTGCTGCGTGTGTTCGCTGGCGCTTCTCCTGCGACCCTTCTACCTGTTCCCTTGCAGCCATCGCTTCCACAGCGACTGTCTCCGGACCGAGATACTGCCGGTGCTGGCGCCCGCACGTCGCAATAAGCTAACGGATCTTCAGAAACAGCTGACGCTGCTGTCTAACATAGAACTGTCGACGGTGACGTCTAGTGGCCTTCCGCTCAGAGAAGTGTTGAAGAACGAGATCGATGACATAGTGGCCAGCGAGTGCCTCTACTGCGGGGAGTACATGATCACTTGTATCGATAGACCATTCATCGCCGACGAGGACTGGGACCGGGTTATGAAGGAGTGGGAATGA

Protein sequence:

>DPOGS210822-PA
MTSIFDQYEQASQVSQRLVPPSEQMTSSGYINIQLDDNKPMFSKRQMNFTPSDLITHVAVSSDYLVLAMANGMIFRLDSAEAALAGLGGVCELLYRRYAPPSSAATALRLHHCSLRLLRHLAHHNAHMHRAPIEYIFDIGKGTDTPITGIQFHRVNNTTKFFIFVTTPKRLYQFIGHAMASDEKPFLQSIFHSYLTTVETGFHEIPSTLKYSKLQFFFDKTNSPKTFAWLTEPGIFYGQLDPTSQQNSNSLFTQGELITYSDKSEKNDTKEATPLSFVLTEFHVLLMYSDRVKAVSLLNQKLVYEDRYSEVHGKLKNIVKDPIGKTIWTVTDKAVFRYKVEREERNVWRIYSDKEQFDLAKQYCQNNPAYIDIINVKQAELLFKKGDYDKSAEIYAETQSSFETVCLKFLECDQVNSLKVYLSKRLDTLDDDKTLISMIVIWMTELFLSQLGSLRRTGKADSNEYHQIQSNFEIFLLQPKVTKCMQHIKTVIYDLMSSHGDKQNLIKLTIINEDHENVVAQNIYEKSYVQALNMLQHLKKPDLFYQFAPALMEEIPRETVNALISLGPILSSSRLLPAFLSCENDEAHVSEIIRYLTFMLQNYNVKDRAIHNYLLTLYAEHDVPALMRYLSRQGQELSMVNYDVHYALRLCREKNLTEACVKLSALLGLWESAAELALQVDTGLAKTVADMPDDVTLQRRLWLGVAEHVITKNQDIKVAMSLLEECPLIKIEDILPFFSDVITIDHFREPICQSLQEYNNQIEELKAEMEDATKSAEYVRSEIQSFRGRSALVCSSDTCCVCSLALLLRPFYLFPCSHRFHSDCLRTEILPVLAPARRNKLTDLQKQLTLLSNIELSTVTSSGLPLREVLKNEIDDIVASECLYCGEYMITCIDRPFIADEDWDRVMKEWE-