Monarch geneset OGS2.0

DPOGS204547
TranscriptDPOGS204547-TA3012 bp
ProteinDPOGS204547-PA1003 aa
Genomic positionDPSCF300297 + 81805-102775
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0175940.083.30% 
BombyxBGIBMGA005337-TA0.080.50% 
DrosophilaLiprin-gamma-PC4e-16754.20% 
EBI UniRef50UniRef50_D6WE190.046.08%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=D6WE19_TRICA
NCBI RefSeqXP_969578.20.049.18%PREDICTED: similar to AGAP007137-PB [Tribolium castaneum]
NCBI nr blastpgi|2700049250.046.08%hypothetical protein TcasGA2_TC010503 [Tribolium castaneum]
NCBI nr blastxgi|2700049250.047.44%hypothetical protein TcasGA2_TC010503 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.1e-14protein binding
KEGG pathway 
InterPro domain[668-727] IPR0211291.7e-14Sterile alpha motif, type 1
[661-735] IPR0109932.1e-14Sterile alpha motif homology
[671-732] IPR0137613.7e-12Sterile alpha motif-type
[751-819] IPR0115102.7e-09Sterile alpha motif, type 2
Orthology groupMCL11899 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204547-TA
ATGGCGCTTGTGCGCCGTATATTAGGCGACGCCCAGGCGAAACTTCGCAAGATGGTGGACGAGCAAGTGTCAGTTGGCACAAGGGTAGAAGCGGATGCAGAACCAGAGCCCGCAGACCCTCTTATCACTCCAGCATGTTCTGCACTCGATCGAACTGACGATCCTCGACCAATTCCACCCGAACGAAGATTCCTCATCAGCACTCTTGATAAACCTCGCAATAGTTCGCTGAATGGGGACAAAGATTCTACATTGAATCGGTCTGCTCGAATATCAATAAAAAATGGAAAGGAAAATCCTTTTATCGCCTCTTCACCAGATGACGATAACAAAGAAAATCAAACAAATGGTAAACTAGATTCACCGATCAAATGGTCCACTCAAAATGGTAGCGATAGTGGAACTTATGCTGAGATTGGGACAGGAGCAAACAGTAACACCAGTGAGAAAAATATTCTGGATCCAGCGGACAGCTCTGAAGAAGAAGTTCCTCTACCAGATGCTACGTCTCCGGGAAGGAGCAGTGAAGGCCCTGAGCCCAGAGCTGATATTACTGCATCTCTTGAGGGAGGTTCAGGGTCCCCAGGAGGTCGGTCGGATCGTTCCCGTCAGGACGAAGTACCGGCATCTCCAGGAATGTTAACAGCTGCACAGTTGGCGCGCCGACTACGTTTAGAAAATGAGCGCCTGCAGGTAATTTTGTTTATGATATTGGCACATCTGGTATTCCTTAACCGCCAAATATGTACCAGTGTCTATATACAGCTGTCGATAGAGCCTCTAAAAGCCGACAAGAAACGGCTCAAAGCGGAGAAATTTGATTTGCTGAATCAAATGAAGCAACTCTACGCCACTCTTGAGGACAAAGAGAAGGAGCTTAGGGATTTCATCAGGAATTATGAACAGATGCGGTCTCGAAGCGGTGCGTCCTCAGCGCTGGGTGCAGAACGAGCTGAACGCGAACGCGAGCGTGCGGCATTGTTGCGGCATGCCCGCGACGAGGCCGAGCGCTCTTTACAACTGGCGGCCGCACTCAGCGCCCGTGATACGCAGTTGCGGCATGCTAGGGAACAGCTTTTTGAGGCTCGAAGACAACTACAAGCAGCAGGGTGTTTGTCCGAAGGTGAGAGTGTAGCGTCTTTGGGAATTGGTCCTCCAATGATGCTTGGAGGTCCCACGGGTTTGATGGGTGATAGAGGTAGCTGCAGCGCAGATTCTGGAGTTAGAGGTAGTAGCGATGGTGGCGCCACGTCGGTTTGCGGCGGAAACCTATCAGACTCCACCGCAGAGGGCGCGCCTCCCACCCTCGACCCATACGATACAGATGCTGTATCGCTGGTGTCATCCGCGCACCCTATATACCAATTAAGCACGCCCCGTGACTGTAGCCCGACTCTCTCACCACATAACAGCGGTTCATCATTCACAAGATCTATTGATGCTGGATCACTATCTAGGTCAGTTGAGCAGTTATCGAGTCCGGGGGAATGTGACTCTGGTATGGTTGGGATGCGGACCCGTCCTGGGGGTTCCAAGGCCGGCCGCGGTCGGGGATCCGCTTGGGGATCCATATCTCGCGTTTTTGCTAGAAGCAGACACCGCACCAAGTCCGGAAGCGCAGCCAGTAGTGGTCACGAGAGCGAGCCAATATACGCTGGCACAGGCAGCACAAGTCGCGCTTGGTCTCCTTTAGGGAGTGAGGCGGCATTACGCGAAGCTGCCTCTCTACCTCTATCAAGATGGCGGGCACCAGCCATCATCGCCTGGCTTGAACTTGCTCTTGGCATGCCGCAATATGCAGCTGCTGTTGCTGATAATGTTAAAAGTGGAAAGATCCGTGCTTTGAATGGGCAGGTTCTGCTCGAGTTGACGGACACGGATCTTGAGGTCGGGTTGGGGGTGACTCAGCCAATGCACAGGAAGAAGCTGCGGCTGGCCATCGAAGAGAGACGGCGGCCGGACCTCGTACGGAACCCTAGCATCGGACAGCTGAGTCATGCATGGGTTGCGGCGGAGTGGTTGCCAGATCTAGGGCTATCCCAGTACGCAGAATCATTTTTAGCCAATTTGGTGGATGCTAGAATGCTGGATACTATCAGCAAGAAGGAGCTGGAGAAATATCTTGGTGTTACGAGAAAGTTCCATCAGGCATCCATTGTCCACGGCATTCATTTGCTACGAATCATGAAATATGATAGACAAGCACTGGCAGTACGGCGGCATCAGTGCGAAAATGTCGATGCGGACCCTCTGGTTTGGACCAATCAAAGGTTTATGCGTTGGTCTCACAATATCGACTTGGGTGAATTTGCTGAGAATCTTAAAGACAGCGGTGTCCATGGTGGTTTGGTGGTACTGGAACCATCATTCACTGGTGAGACCATGGCCACGGCGCTTGGTATACCACCGTCGAAGAGTATAATTCGAAGGCATTTGGTAGCTGAATTTGATGCCCTTGTCATCCCAGCGAGGAATATGTTTGGTCACCAAATAAGGATGTTGGGAAGACCGTTTTCAAGATCGGTTGCGACAGGCTTGCCTGGAATTGACTTTAGCGCTGATTCTAGACGACATAGTCTAAGGGGCTCTATAACACGAGCGTTGGGTGTTCTCAAGCCGAAGCACGATAGACCATCACCATCTAGTTCAAGCGAGAGTTCTAGCGTGATGAGTCTGACACAACCGTACATATCTTATTCACCTCCTATAGCAGTGCGGACGCTGTCTCAATTGAGCATGACATACGCTCCACCACCGACACTGGCAGAGTATGAACCGATATATACGCCTTTGAGTTTATATTCCCAGTCTAGCGTATCCACAAAGGATAGCCTTCAGCGCCTTAATGATGGCAAAGATTATAATATCACCCACAGGTACGGACAAAAAGTAGATCAATCTCATCGAGTCAGTTCACCGTTACCTGAAACATCTGACGGAAATAAGCAAAGACGTCACAGACGAGTGAAAAGTATAGGAGATATTAATGCTTCGAGCAAAACGACGGTTTAA

Protein sequence:

>DPOGS204547-PA
MALVRRILGDAQAKLRKMVDEQVSVGTRVEADAEPEPADPLITPACSALDRTDDPRPIPPERRFLISTLDKPRNSSLNGDKDSTLNRSARISIKNGKENPFIASSPDDDNKENQTNGKLDSPIKWSTQNGSDSGTYAEIGTGANSNTSEKNILDPADSSEEEVPLPDATSPGRSSEGPEPRADITASLEGGSGSPGGRSDRSRQDEVPASPGMLTAAQLARRLRLENERLQVILFMILAHLVFLNRQICTSVYIQLSIEPLKADKKRLKAEKFDLLNQMKQLYATLEDKEKELRDFIRNYEQMRSRSGASSALGAERAERERERAALLRHARDEAERSLQLAAALSARDTQLRHAREQLFEARRQLQAAGCLSEGESVASLGIGPPMMLGGPTGLMGDRGSCSADSGVRGSSDGGATSVCGGNLSDSTAEGAPPTLDPYDTDAVSLVSSAHPIYQLSTPRDCSPTLSPHNSGSSFTRSIDAGSLSRSVEQLSSPGECDSGMVGMRTRPGGSKAGRGRGSAWGSISRVFARSRHRTKSGSAASSGHESEPIYAGTGSTSRAWSPLGSEAALREAASLPLSRWRAPAIIAWLELALGMPQYAAAVADNVKSGKIRALNGQVLLELTDTDLEVGLGVTQPMHRKKLRLAIEERRRPDLVRNPSIGQLSHAWVAAEWLPDLGLSQYAESFLANLVDARMLDTISKKELEKYLGVTRKFHQASIVHGIHLLRIMKYDRQALAVRRHQCENVDADPLVWTNQRFMRWSHNIDLGEFAENLKDSGVHGGLVVLEPSFTGETMATALGIPPSKSIIRRHLVAEFDALVIPARNMFGHQIRMLGRPFSRSVATGLPGIDFSADSRRHSLRGSITRALGVLKPKHDRPSPSSSSESSSVMSLTQPYISYSPPIAVRTLSQLSMTYAPPPTLAEYEPIYTPLSLYSQSSVSTKDSLQRLNDGKDYNITHRYGQKVDQSHRVSSPLPETSDGNKQRRHRRVKSIGDINASSKTTV-