Monarch geneset OGS2.0

DPOGS209449
TranscriptDPOGS209449-TA3204 bp
ProteinDPOGS209449-PA1067 aa
Genomic positionDPSCF300275 - 134600-140806
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0025980.072.79% 
BombyxBGIBMGA005866-TA0.076.28% 
DrosophilaCG40583-PB2e-6336.91% 
EBI UniRef50UniRef50_D6X3480.041.88%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X348_TRICA
NCBI RefSeqXP_970543.10.041.88%PREDICTED: similar to CG40583 CG40583-PA [Tribolium castaneum]
NCBI nr blastpgi|910905320.041.88%PREDICTED: similar to CG40583 CG40583-PA [Tribolium castaneum]
NCBI nr blastxgi|910905320.041.28%PREDICTED: similar to CG40583 CG40583-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055157.2e-33protein binding
GO:00038245.2e-16catalytic activity
KEGG pathway 
InterPro domain[128-687] IPR0110473.7e-36Quinonprotein alcohol dehydrogenase-like
[484-606] IPR0159437.2e-33WD40/YVTN repeat-like-containing domain
[540-936] IPR0110415.2e-16Soluble quinoprotein glucose/sorbosone dehydrogenase
[383-440] IPR0217171.9e-06Nucleoporin Nup120/160
Orthology groupMCL15017 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209449-TA
ATGGCAAACGATTGCGAAGTTAAGTCTATTATGTCTATATCATCTAAATCGGAAGGAAGCACGGCTCCGCTATCTCGCATTCGGTGGCGAAATGCTCCTCTTCACGAGAGATGCAGCATGGCTGACATGGTCAAGATTAAAGAAGCGTTCCAGGCAGCTTATAGAAATAAAATGCTACCAACAGAATTTAGAAATCTCTTGAGGACATTGTTGAATGTTGAATATGATGATGACGAATTTAATATACTTTTTATGAAGATAAATACGGCTCGCAACGGAGAGATCGATTGGGACGAGCTCGTGTCGCATCTACTTCTGGGCTATTTCGGCAATGACCCTGAGAACCAGAGAGCCTCCCTCCAGCTGCCGATTATGGGACTGCCGACTGTTATGAGATCACAGCATCGGCACCCGATATCGAGAATATGTTTCTGTCCTGACGACCGGAGCACGGATCCCATGCAGGGCAACTACATAACGGCGAGTCGCGATGGAATGATTAACTGGTGGTCCCTGGACATGTCGCTGTTGAGGACCGCCTTCTCTTCCAGTCCTCACCTGAAAGTGCGCACGACCTGGGTGACGGACATGGTCTGTATGCCGGACGTCAACATTATTGTGACGAGTTCTACCGAACGAGACCTGCGCTTCTATGACTGTACGGCGAAGACTTTCACCCTTAAGATCGTCATCACGAGCTGGGAGTACATGATCTGCTCCATGTACTACCACTTCTACAAGGACCCGAACGAGCGATGTATCCTGATCCTGGGCGACGTCGGTGGTCACGTGCGCGTGCTCACCTTCTCTCCCATAAAGCGCGGTCCCTTCCGCAACCAGCCCGGCCGGGCGCTGCAGCAACTGCGCCACGTCGACCTCCAGCGACGGCCCCACTTGTTGCCGGAGCTGAGGTTGACGGAGAAGGGTCGCGTCCACGGCGAGTGGGTCCGTCAGGTGTCGTTCTATTCGTCCCTGCACTGCATCGTGTCGTGTGCGACTTGCCCGGACTCTCTGCTCATGTGCGACCTCGCCGGTTCCAAGACCTATAACATGTTTCACGTCGAGAAGGGTATTCAGTGTTTCACATTCGACGAGGAGGCCCACATCCTGGCGACGGGAGGCCCGGACTGCACGGTCCGCGTGTGGAGTCCGTTCGTGCCGCGCGCCGCGAGCGTGTCGCTGACGGGGCACCACGCCGCCGTCACCGCGCTCGTGCTACAGATCTCCGCCACGACCTTGCTGTACTCGCTCTCCAGAGACCGCGTCATCAAGGTTTGGGACGTGCAGGGACAGGTGTGCCTTCAGACATATATAGACATCCCTCCGCAAGTTGGCGAGCGCACACCCATCTCCGCTCTATATAACCCGGTGACTCGCGAGTTCATAGTGGGAGCTATAAAGATCGCCGTGGTGGTCCTGGACGAACAGCTGAACCCCCTCCACACGGACGGGTTCACGCACTCGCGGGCCGTGTCCAAGATCCTCTACAATCCGCTGTTCAAGGTGGTCATCACGTGCGGCCTGGACAGCATCATCATCAACTGGGATCCGGTGACCGGTAAGCGCAACGCCATGGTCCGCGACGCCCACACCCGCCTGCTGCACGGCGAGTCCATCCCAGTGGAGATCACGGCCGCCTGCTTCGACCCCGGCTACCAGCTGCTGCTCACCGGCGCCAGGAACGGGACACTCAAGGTGTGGAACTTCAACACGGGGATATGCTTGAGGAGTATGGCCATAGAACACATGTGCGAAGTCACGAGCTGTTTTTGGGTCGAGGGGAGGATTCTAGCGGTGGGATGGAACCGACACGTCATAGAGTTCGAGGACGCGGGGGGGTCGGGAGGGTCCGGGGGGTCGGGAGCGGTGGAGGGGGGCGGAAAGGCCTGGGAGACCAGGCACAGCGACGACGTGCTGACGGCCGCCGCCCGTCCCCCGCTCACTCTCGCCACCGCCTCCTACAACTCAGAACTCATTCTGTGGAAGCTCGAGACGGGACAGCCTTACAGGCGATTCTCATGCACCGAGCCGATGCTGCGCATCAAGATGCAATACAGCAAGCGAGCTGCCTCGCCCCAGACGGCCTCCGCGACCTCGACCCGGCGGAGCACCTCCACCGTGAGGCGGCCGAGGTGGTCATCAGGTGTCTTTGGTTCATACGTGGGCGGTCAGCGGGAGTCCCTCATCGACTCGACCGCCGCCATGCAGAAGGCGCGAGCTCGGCGCGTGTCCACGGTGTCTCTGCCGGCGCGGGCGCAGAAGATGCGGCAGCTCGCCGTCCACGCCACCATATTCCTCACCACGCGACCCTGCCACATGCGCCCTCCGTCGCTCATGGTGGCTCTGGAGAACGGCCAGGTGCAGTGCTGGTCGGACCACCCGGCCGGCGGCCTGCAGGGCTCGTTCCAGGGCATCCACACGGCGGGCGACTACGTCTCCGCCTTCGCCACCGACGTGGCCAACGACTACCTCTTCGCCGGGACCACGGTCGGCTACATCAAGATCTGGCTCATGACCAACTACCTCACCAACGAGGAGGTTCACGTTAACATGCCGAAGCTGCGGCTGACGTTCCCGTTCCTGTGGCGGGACCGTATCGAGGGACGCGCCAAGCGCTGCGTGCGCGACCAGCCGCTGCCCCTGCTGCTGAGCAGCGTGCGCGCCCACCTGCGCTGCATCACGGCGCTCGCCTACATCGACGACCTCCGCCTCGTCCTCAGCGGTTCTTCGGATTACAGCGTGAGAGTGTGGAAGCTGTCCGGGGAGTACCTGCAGACGTTGGGTAGCTTCTTGCCGTGGACGTTGGAGGTCACTCGTTTTCCTCCCGACGTCAAGAAAGTCGCCAGCTTTACAACGTTCAAGGTGTGGCGCGGCGGATGGGTCTCCCGCTACACCCCGGGCCAGGTGGAGGTGGACCTTCTCCGCGACATCACCTCGCACGAGCTCGCCACCCGCACGTTCGGCGCTCCGGCCCCCGCGCCCCTGCTGGGTCACCACCTCGCCCTCCCCCGTAAACCTGACACTCAGCCGCCGCCGGGACTCGACGACTCGCTGCCCACGATCCCGCTGTACACCCACCTCCGCATGTCGTCGACGCAGCCCGTCCGCCGCATGGCGACTCCGCCCCTGGCGCGCGCCACTCGCCTCCGCCGCGCCGCCGCCGCCCGCGCGCCCAAGACTCACTTCCACGGCGACAAGTCATGA

Protein sequence:

>DPOGS209449-PA
MANDCEVKSIMSISSKSEGSTAPLSRIRWRNAPLHERCSMADMVKIKEAFQAAYRNKMLPTEFRNLLRTLLNVEYDDDEFNILFMKINTARNGEIDWDELVSHLLLGYFGNDPENQRASLQLPIMGLPTVMRSQHRHPISRICFCPDDRSTDPMQGNYITASRDGMINWWSLDMSLLRTAFSSSPHLKVRTTWVTDMVCMPDVNIIVTSSTERDLRFYDCTAKTFTLKIVITSWEYMICSMYYHFYKDPNERCILILGDVGGHVRVLTFSPIKRGPFRNQPGRALQQLRHVDLQRRPHLLPELRLTEKGRVHGEWVRQVSFYSSLHCIVSCATCPDSLLMCDLAGSKTYNMFHVEKGIQCFTFDEEAHILATGGPDCTVRVWSPFVPRAASVSLTGHHAAVTALVLQISATTLLYSLSRDRVIKVWDVQGQVCLQTYIDIPPQVGERTPISALYNPVTREFIVGAIKIAVVVLDEQLNPLHTDGFTHSRAVSKILYNPLFKVVITCGLDSIIINWDPVTGKRNAMVRDAHTRLLHGESIPVEITAACFDPGYQLLLTGARNGTLKVWNFNTGICLRSMAIEHMCEVTSCFWVEGRILAVGWNRHVIEFEDAGGSGGSGGSGAVEGGGKAWETRHSDDVLTAAARPPLTLATASYNSELILWKLETGQPYRRFSCTEPMLRIKMQYSKRAASPQTASATSTRRSTSTVRRPRWSSGVFGSYVGGQRESLIDSTAAMQKARARRVSTVSLPARAQKMRQLAVHATIFLTTRPCHMRPPSLMVALENGQVQCWSDHPAGGLQGSFQGIHTAGDYVSAFATDVANDYLFAGTTVGYIKIWLMTNYLTNEEVHVNMPKLRLTFPFLWRDRIEGRAKRCVRDQPLPLLLSSVRAHLRCITALAYIDDLRLVLSGSSDYSVRVWKLSGEYLQTLGSFLPWTLEVTRFPPDVKKVASFTTFKVWRGGWVSRYTPGQVEVDLLRDITSHELATRTFGAPAPAPLLGHHLALPRKPDTQPPPGLDDSLPTIPLYTHLRMSSTQPVRRMATPPLARATRLRRAAAARAPKTHFHGDKS-