Monarch geneset OGS2.0

DPOGS210722
TranscriptDPOGS210722-TA2661 bp
ProteinDPOGS210722-PA886 aa
Genomic positionDPSCF300013 - 80280-87125
RNAseq coverage539x (Rank: top 23%)
Annotation
HeliconiusHMEL0024190.060.63% 
BombyxBGIBMGA006333-TA0.060.41% 
DrosophilaCG2943-PA9e-16338.63% 
EBI UniRef50UniRef50_E2BNN61e-17238.91%Uncharacterized protein KIAA0090-like protein n=7 Tax=Formicidae RepID=E2BNN6_HARSA
NCBI RefSeqXP_624458.13e-17838.74%PREDICTED: similar to CG2943-PA [Apis mellifera]
NCBI nr blastpgi|1571381851e-17538.41%hypothetical protein AaeL_AAEL003785 [Aedes aegypti]
NCBI nr blastxgi|1571381856e-17338.49%hypothetical protein AaeL_AAEL003785 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[683-885] IPR0116782e-47Domain of unknown function DUF1620
[23-511] IPR0110471.1e-09Quinonprotein alcohol dehydrogenase-like
Orthology groupMCL13635 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210722-TA
ATGAAATGGTTTATTATATTCTTTAGTTTAATCAATCTCTCAGTTTGTATATATGAAGACCAAATCGGCAAATTTGACTGGCGTCAGACATACGTTGGTCGTATAAAATTTGCACAGTTTGATACAGTTTCTACGGCAAAAAAAATTATAGTTGCTACCGAAGAAAATGTACTAGCAGCTCTTCATTTAAAGACTGGTGAAGTTGTCTGGCGGCATGTATTTGAAAATGCTTCACCTGGTAATATACATCTGCTGCATGTGGGAGAGAAGATCACCACAGTGACCGGAGATAATCCAATGAATCAGTCTCCTACCCTGAGCGCTACAGGTGTACTGATGTGGGAGTGGACTCTCATGCTGCAAGATGACAGTAGAGCTGATTTCTCCGAGTGGTGGGTGCAAAACGGAATGCTTGTGCACATGCTGCCTGTCTTTAATTCTCATCTTGAAGTCACTATGTACAATGTGATGTCGGGAAGCAACAGAGGTGCTACATCTAAGTTACCAGCTATCTGGAATAATGAAGGGCAAGTAATTTGCATTTGTGTCCTCACTGCCCCCTATTACACTTGTGTGTCGGGTGAATTTGGAAGTCAAATCTTGGTGTCAATGGATGTCACAGCGAATGCCATACAAATGATAAGCAAACCGCTATCGAACATCATTGAAGGTGCTGTCGGTAACTTGCGTGCATTAGACGGTAACAGTGTCATCCCTGGCTTCATAGTGGACGACAAGAAGATAGTGCTGATTAAGGAAAATGATTTTAACGTTTTGAACGTCAAGGTTGAAGATACCTTAGCGAGTGCAAGTATCGCCGATGGTGCTAGAGGTCCTCTCGTGCTGCAGATATGGACGAATTATGCTAAAGGCTACCAACTCATAGCACACACACTGTCTGGTCAATTAATACCAGAAATACACAGCCCTGATTCATTCCTCGATATCCCCGAACCGGAGTTGTTAGCGGTGACATGTGCCCGCGACCAGCTCGCCTGTCGCTTGCTCATCAACGCTGCCGATGACGCTGTACATTTGGTACAGCAAGGAGGTGTAACGTTATGGTCTCGTGAGGAATCCCTGGCTAATATAAAAAGCGTGGAGTTTGTGGACCTTCCAGTGTCAGATGCTGATGCTGCTCTGGAATCAGAGTTTGATCAGAAAGAAGGTTCCGTGTGGTGGTCGTTCGTCCGTCGCCTCCAGTCGCAGTACCAGCAGCTGTCGGCTGCGGTGGAGCGTCTGCGGAGCGGGGAGGTGCTGGACCGGGGCTCCGCCTCCCTCCACCGGGACTACTTCAACCTGCACAGGATCATGGTCCTTGTCACGGAAGCTGGGAAGATCTTCGGTATGGACAACCTGTCGGGGTCGCTGGTGTGGCGCCTGTACTTACCGACCCTCTCCGGCGCCCGGACCATACTGCTGAGGAGAGCGGCGCGGCACCCTCACACAGCCATGATCACCATCGTCGGCACACACACGGACACAGGCAACGGTTACATAGTGACCCTTGACCCCATCACCGGGAGGATGGTCCCCGAGCACACCGTCACGCTGGACGTGGGGATAATGCAATGTATGACGTTACAGGAGACGGGAGACGATCAGCTGAGAGCTCTCATTGTACTGGACGAGGACGAAGCCGTTCAGGTGTACCCGCCGTCCGCCGCGTCGCTCGTCCACAATGTGCACATGTATGTAGCGGACCAGGACACGGGCAGGGTCAGGGGGTACGCGATCAGATATAACGGAAGGGAGGCGGTGGCCGAGCGGACCTGGTCGATGTCCCTGGGCGGGTCGGGTCCGGCTCGTATCGTGGCCATGTCGTCCCGGTCCCGCCTGGAGCGCGTGAGGTCCCCGGGGCGCGCGCTCGCCGACCGCAGCGTCCTCTACAAGTACTCCAACCCAAACATGCTGCTGTTTGTTGTCGAGAAACCCGATCCGACTCATAAAGAGGTCGTGACGGCGGTGGTGGTGGACGCGGTGTCGGGCGCGGTGGTGGGCGCCTCCTCCCACCGCCGAGCGCGCGCCCTGCCGCTGGCGGTGCACGCCGACAACTGCTTCGCCTACCTCTACAGGAGCGACAAGCACCGGAGAGTCGAGATAGCCACGATGGAGCTGTACGAGGGTAAGGACCGCTGGTCCCCGGCCGGCGAGCCGTTCAGCTCGTCCGCCAGCTGGCGCACGCCGGTGGTGGAGCGTCAGGCGTACATCCTGCCCGCACTGCCCTCCGCCGCCGCCTTCACCATCACCGAGAGATCGCTCACCGACAGACACGTGCTCTTGGGCCTGTCGTCGGGCGGCGTGGTGGAGGTACCGTGGTCGTTGGTGGAGGCGCGGCGCGGCGCGGCGGGCGAGGAGTCCGTGCTGCCCTACCTGCCCGAGCTGCCCCTGACGGCGGACCGCGTGCTCTCCTACAACCTCACCCTGCACCGCCTGGCCGCGCTGCACACCGCGCCCGCGGGCCTCGAGTCTACCAGTCTGATGCTGGCCACCGGACTGGATCTGTTCTACACGCGAGTGGCGCCCTCTAGGACGTTCGACCTGCTGAAGGACGACTTCGACTACTACCTCATAACGATAGTGCTGGCGGCGCTCGTGCTGGCAACGTACGGCACCAAGTACCTCGCCTCCAGGAAGACGCTCAAGATGGCGTGGAAGTGA

Protein sequence:

>DPOGS210722-PA
MKWFIIFFSLINLSVCIYEDQIGKFDWRQTYVGRIKFAQFDTVSTAKKIIVATEENVLAALHLKTGEVVWRHVFENASPGNIHLLHVGEKITTVTGDNPMNQSPTLSATGVLMWEWTLMLQDDSRADFSEWWVQNGMLVHMLPVFNSHLEVTMYNVMSGSNRGATSKLPAIWNNEGQVICICVLTAPYYTCVSGEFGSQILVSMDVTANAIQMISKPLSNIIEGAVGNLRALDGNSVIPGFIVDDKKIVLIKENDFNVLNVKVEDTLASASIADGARGPLVLQIWTNYAKGYQLIAHTLSGQLIPEIHSPDSFLDIPEPELLAVTCARDQLACRLLINAADDAVHLVQQGGVTLWSREESLANIKSVEFVDLPVSDADAALESEFDQKEGSVWWSFVRRLQSQYQQLSAAVERLRSGEVLDRGSASLHRDYFNLHRIMVLVTEAGKIFGMDNLSGSLVWRLYLPTLSGARTILLRRAARHPHTAMITIVGTHTDTGNGYIVTLDPITGRMVPEHTVTLDVGIMQCMTLQETGDDQLRALIVLDEDEAVQVYPPSAASLVHNVHMYVADQDTGRVRGYAIRYNGREAVAERTWSMSLGGSGPARIVAMSSRSRLERVRSPGRALADRSVLYKYSNPNMLLFVVEKPDPTHKEVVTAVVVDAVSGAVVGASSHRRARALPLAVHADNCFAYLYRSDKHRRVEIATMELYEGKDRWSPAGEPFSSSASWRTPVVERQAYILPALPSAAAFTITERSLTDRHVLLGLSSGGVVEVPWSLVEARRGAAGEESVLPYLPELPLTADRVLSYNLTLHRLAALHTAPAGLESTSLMLATGLDLFYTRVAPSRTFDLLKDDFDYYLITIVLAALVLATYGTKYLASRKTLKMAWK-