Monarch geneset OGS2.0

DPOGS209317
TranscriptDPOGS209317-TA2823 bp
ProteinDPOGS209317-PA940 aa
Genomic positionDPSCF300234 - 28895-35963
RNAseq coverage1343x (Rank: top 9%)
Annotation
HeliconiusHMEL0180770.052.95% 
BombyxBGIBMGA013819-TA6e-17791.53% 
DrosophilaCG6282-PA4e-12972.13% 
EBI UniRef50UniRef50_Q7Q6P72e-13571.75%AGAP005737-PA n=5 Tax=Culicidae RepID=Q7Q6P7_ANOGA
NCBI RefSeqXP_393140.12e-13673.79%PREDICTED: similar to CG6282-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3287840349e-13669.58%PREDICTED: hypothetical protein LOC409642 isoform 2 [Apis mellifera]
NCBI nr blastxgi|3287840342e-13170.81%PREDICTED: hypothetical protein LOC409642 isoform 2 [Apis mellifera]
Group
KEGG pathwaylla:L1762384e-23 
 K12343 (SRD5A1)maps-> Steroid hormone biosynthesis
InterPro domain[646-885] IPR0107212e-76Protein of unknown function DUF1295
[459-562] IPR0008591.8e-08CUB
Orthology groupMCL16025 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209317-TA
ATGTTCAAAACTTCTATATCTTTGGCTCTCGTGGCCGTCGTGTTGAGTGTCGTTTTAGCTGATGACAGTGACCAGAAGGAGACCTGGTATGATGATTACTACTTAGGAGACCTTGACGATAAAGAGATCTTTCAGCCTTTTACATCGCTACATAGAAGTACTATATCAGATGGTAATGAAGACCTACCAGCTGAAATTAAAGACACCGATGATCTCTCCTCCGATATAAATGAATACAACGAAGAAAAGAAATTCGAGGATGAACTGAAAAGGTACGTGATTGACAGGAAAAGTGACGCTCAGGATGATAGTAGCTTTGATGACTACCAGTTCGAAGATAATAGATTTGGTGACGATAACTCATACGGTGATGATCAAAGTGATGAATTTAATCCACTTCTAAATAAAGAGTTACAAGTTGAGACTGACAAAGAAAAACATGTCGAAGCATCGACTGAACTGCTAAATGATTTATCAAAAACAAATAAGAGTACAGATAAAGATATTCTAGAGGAGACGAAATCAATTTTAAGTGAAGAGACGATAAACGATTTCGCTCATGATAAAACATTTGGAGCGAATGTTGTAAATGATACTGAAAGCGGGCATATATTAGAGGAGACCAGAAAAATTATAAACGAAAATTATGGATCAATAGAAGACAATATCGACAAAGATGGTGGTGAAGAATCGATCAGAGATTCTGAAACTGAAGAAGATCATAGGACAATGATAACAACATCCGAAGAAGGAGGAGAGAGAGAGGAAAAAGGAGACACCATCGATACAGCTTATAGTGATATAATGCACGATTTGAACAGACTCCATGGAACATGGAAAAAATTAAACGAAGTTGGCGATGACGTTGATGATAACGATTACAGTGAAAGTGAGGAACAACAGGAAATGGACGGAGACTACGAAGAAATTGGTGATTCAGAACTTGAGAGAATATTTGAAAATAGTTCCAACAATGAACAAGAGAACGTGGAAGCTTCTGATGAAATACTCTCTGACGAGACATCTGCAAGTCATGCAAACACTTATAAACCTCTGCCGTACATAAACGATGCGTTAGTGGCGAGCGATGTAGACAGAAGCTTACCAAAGAGTACTAACGATAGCAGTGATAGTAACAATGCTAGCAGCACAGAGGAGTCAAGTGTCACGGAGAGCTCCAAGGCTTCTGGAACCACTGAGTCCATGTCTACAACCGAAGCAGTGGAAACAACATTAAAATCAGAAATGAATGAAATGTCGATTGCCGAGGAGGACGCCGCCATCTTGAAGTTTACTGAAGTCAATCCCTCGATACTAGATGTCACATCGAGTGATATCATTAATGCAACTAACGTTTGGCTGACAGTGAACGGTTCCGTGGAAGTGACGTCACCCGACTATCCCTCTCCGTACCCCACGAATAATACCGTGGACTGGATGTTCCAAGGAGCCGGCCAAGGAATAGAATTAAATATAACGGAATTCTCCGTCAACGGTTACCTCGGGGACTACCTGTTGGTTAAACCAGGTGGAGTGGACACGTCGGGTTCTTCGGGCCTCATCTTCACCTACTCGCTACGGACCGAGCGTCGCTACAGGTTCTTGGACGTCGATAAGATGTTCGTGCGTTTCGTGGCCAAACCTGGAAACCAGTTATTCAGAGGGTTCAAGTTCAGCGCCCGTATGGTCGTCGATCGACCAGAGTCGATACCTGAACCCGAGGAGGACGTTCCCGCTCCCGTGTCTCCCGCCACCATCACTGTCAACCTGGGCGGGATCTCACTCCAGGATTTCCATGGGGTCGAAGAACAGTTCCGTCGGATCATCGCCGACATGGCCACCTTGTATATCAACACCAACGACATCGACGCTGGACTCAACGCTACGAACAACTTCGCAGTAAGTGCTATAGTAACCGTGGCCATGCAGATACTATTCTTCACCATCGCTTCTTTGTCTCAGAGTGATAAAGTAACAGATTTTACTGGAGGTGCTAATTTTATTATTATAGCGTTATTAACATTCTTCCTCGGCCAAGGTGGGAACACTCTCAAGAACTATGACAGTAGGCAACTGATGGTCACCGCGTTCATATGCGTGTGGGGCGTGAGGCTGTCGGGGTACCTCATATACAGGATCTATCACATCGGCAGAGACAAGCAGTTCGAGGATCGTAAGAGCAACACTCTGAGGTTCGCCGTCTTTTATACTTTCCAAGCTGTGTGGGTGTACGTGGTCAGTTTGCCAGTCATCATTATAAACTCACCGCATCACTCCTACCCCAAGGCGCCGAAGACGATGACGACGTTAGACTCGGCCGGAGCCGGTGTTTTCGTCATCGGATTGTTGATCGAAACTTATGCAGATTTACAAAAATTTGCATTCAGACAGGAACCAGCCAATCAAGGAAGATGGTGCAACGACGGCCTCTGGGGACTATCACGACATCCCAATTATTTTGGTGAGGTCGTTCTCTGGTGGGGCATATTCATAATATCATTGAACATCATAGAAGGCGTCGAATACATTGCTGTGTTGTCACCATTGTTCACGACAGCAATAATATTGTTTTTATCTGGTATACCGTTACTAGAAAGATCAGCGGACGAAAAGTACAGAGATAACCCAGACTATCTATACTATAAAGCGTCGACGTCCCCCTTCATACCGATACCGCCCGCTATCTACGTCGAGGTGCCGAGGTTCCTGAAATGTATGTTGTGCTGTGAGTTCCCCATCTACGACTCCACCGGCGACGAGTTCCCCGCGCCTACCATCGTCACCGAGACCACGAGCATATCGATGGTGCAGTCGCAGACATAG

Protein sequence:

>DPOGS209317-PA
MFKTSISLALVAVVLSVVLADDSDQKETWYDDYYLGDLDDKEIFQPFTSLHRSTISDGNEDLPAEIKDTDDLSSDINEYNEEKKFEDELKRYVIDRKSDAQDDSSFDDYQFEDNRFGDDNSYGDDQSDEFNPLLNKELQVETDKEKHVEASTELLNDLSKTNKSTDKDILEETKSILSEETINDFAHDKTFGANVVNDTESGHILEETRKIINENYGSIEDNIDKDGGEESIRDSETEEDHRTMITTSEEGGEREEKGDTIDTAYSDIMHDLNRLHGTWKKLNEVGDDVDDNDYSESEEQQEMDGDYEEIGDSELERIFENSSNNEQENVEASDEILSDETSASHANTYKPLPYINDALVASDVDRSLPKSTNDSSDSNNASSTEESSVTESSKASGTTESMSTTEAVETTLKSEMNEMSIAEEDAAILKFTEVNPSILDVTSSDIINATNVWLTVNGSVEVTSPDYPSPYPTNNTVDWMFQGAGQGIELNITEFSVNGYLGDYLLVKPGGVDTSGSSGLIFTYSLRTERRYRFLDVDKMFVRFVAKPGNQLFRGFKFSARMVVDRPESIPEPEEDVPAPVSPATITVNLGGISLQDFHGVEEQFRRIIADMATLYINTNDIDAGLNATNNFAVSAIVTVAMQILFFTIASLSQSDKVTDFTGGANFIIIALLTFFLGQGGNTLKNYDSRQLMVTAFICVWGVRLSGYLIYRIYHIGRDKQFEDRKSNTLRFAVFYTFQAVWVYVVSLPVIIINSPHHSYPKAPKTMTTLDSAGAGVFVIGLLIETYADLQKFAFRQEPANQGRWCNDGLWGLSRHPNYFGEVVLWWGIFIISLNIIEGVEYIAVLSPLFTTAIILFLSGIPLLERSADEKYRDNPDYLYYKASTSPFIPIPPAIYVEVPRFLKCMLCCEFPIYDSTGDEFPAPTIVTETTSISMVQSQT-