Monarch geneset OGS2.0

DPOGS205913
TranscriptDPOGS205913-TA1662 bp
ProteinDPOGS205913-PA553 aa
Genomic positionDPSCF300089 + 265511-279802
RNAseq coverage496x (Rank: top 25%)
Annotation
HeliconiusHMEL0094482e-9893.75% 
BombyxBGIBMGA007012-TA0.090.68% 
DrosophilaCG31145-PC4e-15670.23% 
EBI UniRef50UniRef50_B4JIB42e-15770.74%GH19066 n=7 Tax=Endopterygota RepID=B4JIB4_DROGR
NCBI RefSeqXP_002032363.11e-16052.29%GM23552 [Drosophila sechellia]
NCBI nr blastpgi|1953313452e-15952.29%GM23552 [Drosophila sechellia]
NCBI nr blastxgi|3479688407e-15754.38%AGAP002913-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[193-540] IPR0095811.1e-230Protein of unknown function DUF1193
Orthology groupMCL13697 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205913-TA
ATGGACTACGGCATCTCCGGACACAGGGTACCATTGCATGGACGTGTCAAAATAGGCGATGACACGGATAAAGGAAGATCGGCGTATATAGAGTTTAGAAAAAGATTTCTACAGAAAAGCAATGGTAGCAATGGTTCACGAGAGTACGAGCAGTCGGCTCCGAGTGAGACGAGCGGTGGCGACGCTGAACCTAGGACGCCGGCGACACCTGCCGATAGGTTCGAGGACCTGCAGAGGATACTGGTCTTCCAGTTGCAGGGGAAGTCTGATGACAACCCTGTGGTGGTGCCACCCCATCGAGATCTGAACGTCTTAAGACCCGAGAACCCTACCATCGGTGAAATGGAGGACTTGGAACCAAGAATAGCTGAATATGTCTCGAAAGCCGTCTTTTCATCTACTACGGAGCAGGCTCAGAGCTCTGAAGTATGCATCAAAACCAAACAGGTTATCGGAGTATGCCACAAATCGAGAACGTTACATGTAGGCTCCTTAGTATTCAGTGTCGCACCGTCGCCTGGAGTGCCGCCGCAACCAACTCACGTCCACCCATGTACCACGCAGCCAAAACCAACACGCTCAGGAAAATCTTCCATTGTAAATGCCTCAAATCTGGAGAAATTTCAATTGAAAATAGCGCAACACGAGCTCTATGAAGACGGAGAGCCGCTGGTCAGCGCCATTCTTCGCGACATGACCTTTGAACCCATCCTACATGTTGAACAAAAGGAAGGAGGAACGCAACTGAAGCTCATCATCGACTATCCAAATGGCGTGCAGGCTTTATTCAAACCGATGAGGTTCGCCCGGGATGTACAAACTCTACCTAATCACTTCTACTTCTCAGACTATGAGCGGCATAACGCTGAAATTGCTGCTTTTCATCTTGACAGGATACTCGGTTTCCGTCGAGCGATGCCGGTGGTGGGTCGAGTTGTGAATATGACCACTGAGATCTATGACGTCACTGAGGGAGACATCTTGAAGACGTTCTTCGTATCTCCAGCGAACAACTTCTGTTTCCACGGCAAGTGTTCGTACTATTGCGACACAGGACACGCGATATGCGGCAATCCGGACATGTTAGAAGGCAGCTTTGCGGCTTTCCTACCAACCTCGGATCTAGCGGAGCGTAAGGTGTGGAGACATCCCTGGCGAAGATCTTATCACAAACGAAGGAAAGCTCAGTGGGAGCTGCAGTCCGACTACTGTGATACGGTCCGCAGTACTCCGCCCTACGACTCCGGCCGTCGTCTGTTAGACCTCATAGACATGTCGATATTCGACTTCCTCACTGGGAACATGGACAGACATCACTATGAGACATTCAAAATGTTCGGCAACGAGACTTTTACGTTGCACCTGGACCAGGGACGAGCTTTCGGTAAGGCGTTCCACGACGAGCTCAGCATACTCGCGCCACTGCTACAGTGCTGCACCGTTAGACACACTACGCTTGCAGTCCTGCTTAAATTCCATAACGGCGTGCCATTATCGAAAGTGCTCCGAGATTCTATGAAAGCTGACCCCGTGAATCCCGTGCTTTGGGAGCCTCACCTGGCCGCGTTAGACAGGCGTATAGTTACAGTACTGGACGCGATCAGGAAATGCATAGATAAATTAGAAAATCCTCTACCGAATGAACTGAACTCTGTCGTGTGA

Protein sequence:

>DPOGS205913-PA
MDYGISGHRVPLHGRVKIGDDTDKGRSAYIEFRKRFLQKSNGSNGSREYEQSAPSETSGGDAEPRTPATPADRFEDLQRILVFQLQGKSDDNPVVVPPHRDLNVLRPENPTIGEMEDLEPRIAEYVSKAVFSSTTEQAQSSEVCIKTKQVIGVCHKSRTLHVGSLVFSVAPSPGVPPQPTHVHPCTTQPKPTRSGKSSIVNASNLEKFQLKIAQHELYEDGEPLVSAILRDMTFEPILHVEQKEGGTQLKLIIDYPNGVQALFKPMRFARDVQTLPNHFYFSDYERHNAEIAAFHLDRILGFRRAMPVVGRVVNMTTEIYDVTEGDILKTFFVSPANNFCFHGKCSYYCDTGHAICGNPDMLEGSFAAFLPTSDLAERKVWRHPWRRSYHKRRKAQWELQSDYCDTVRSTPPYDSGRRLLDLIDMSIFDFLTGNMDRHHYETFKMFGNETFTLHLDQGRAFGKAFHDELSILAPLLQCCTVRHTTLAVLLKFHNGVPLSKVLRDSMKADPVNPVLWEPHLAALDRRIVTVLDAIRKCIDKLENPLPNELNSVV-