Monarch geneset OGS2.0

DPOGS214304
TranscriptDPOGS214304-TA5235 bp
ProteinDPOGS214304-PA1744 aa
Genomic positionDPSCF300020 - 1080181-1094119
RNAseq coverage264x (Rank: top 40%)
Annotation
HeliconiusHMEL0034590.094.59% 
BombyxBGIBMGA004109-TA0.091.81% 
Drosophilanito-PB0.067.23% 
EBI UniRef50UniRef50_G6DTG30.0100.00%RNA recognition motif protein split ends n=5 Tax=Pancrustacea RepID=G6DTG3_DANPL
NCBI RefSeqXP_001970541.10.067.61%GG23320 [Drosophila erecta]
NCBI nr blastpgi|3123709260.066.44%hypothetical protein AND_22865 [Anopheles darlingi]
NCBI nr blastxgi|1571316680.077.65%RNA recognition motif protein split ends [Aedes aegypti]
Group
Gene OntologyGO:00054881.4e-60binding
GO:00168491.4e-20phosphorus-oxygen lyase activity
GO:00091901.4e-20cyclic nucleotide biosynthetic process
GO:00355561.4e-20intracellular signal transduction
GO:00036761.4e-14nucleic acid binding
GO:00001664e-13nucleotide binding
KEGG pathway 
InterPro domain[347-515] IPR0161941.4e-60Spen Paralogue and Orthologue SPOC, C-terminal-like
[360-482] IPR0129215.6e-27Spen paralogue and orthologue SPOC, C-terminal
[589-765] IPR0010541.4e-20Adenylyl cyclase class-3/4/guanylyl cyclase
[70-142] IPR0005041.4e-14RNA recognition motif domain
[57-145] IPR0126774e-13Nucleotide-binding, alpha-beta plait
Orthology groupMCL11628 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214304-TA
ATGCATGAGTATCCCATGGCAGGTCCTCACGGGCCTCCAATGCACCATCGCCCACCCATGCATCATCCACCTCATCCTCATTACATGCCACGCCCTTACATGCCGCGTCCTCATCACCCACCATTTGAAAAAATGGAAAACAAAAAAGACAAGTTCCCTAATTACTTACATCATGTTCAACCAGAAGATGATCCTCTTGCAACAAGAACTTTGTTTGCTGGGAACTTGGAAATAAATATATCAGATGAGGAATTAAGAAGAATCTTTGGTCGTTATGGGATTGTTGAAGATATTGACATTAAAAGGCCTCCTCCAGGCACTGGAAATGCATTTGCATTTGTTCGCTATCAAACATTAGACATGGCTCACCGAGCCAAAGTAGAGCTATCTGGCCAGTATATTGGTAAATTTCAATGTAAAATTGGATATGGCAAAGCTACACCGACTACTCGTGTTTGGGTTGGTGGGCTAGGTCCATGGACATCAGTAGCCCAATTAGAAAGAGAATTTGACAGATTTGGGGCCATAAAGAAAATTGAATATGCTAAAGGTGAACCTCATGCATATATACTGTATGATTCGATAGATGCAGCTCAAGCTGCTGTAAAAGAAATGAGAGGCTTTCCATTAGGTGGACCAGACAGGCGCCTTAGGATTGATTTTGCAGATGTCGGCACTGGGGGACCATACAGACCGAAACCATATGCAGCACCCGTTGAAGAAGGTCGTTCTGAAGGTTATGAAGGATATGAAGGTTCTTGGGAGGATGGTTATAGTTATGGTTCTGGTTATAGAGGTAGGGGCGGCCACCGTGGGCGAGGTCGTGGTATGTATCGTGGAGTGTATCACGGCAGCGCTGATTATAGGGATGAGGAATGGAGGAGAGCACCAGATGCTGAATATGACAGTAGAGCTCGTCGTTCTGGTTCCCGAGAACCTGGCGTTGACAGATCACGTTCCCGTTCTCCACGTCGTCGTTCTCCCGACAGTGATTCTGATGGATCTCCCCGACGTAGCAGTGGCATGCTTGCCTCAGCTAGAACACTCCCTGAGGTTGTTCGTAAAGCTACAACAATCTGGAATGGTGCCCTCATACTCAAGAATTCCTTGTTTCCAACTAAATTCCACCTTACAGATGGAGATTCAGACATAATTGACAGTTTAATGAAAGATGAGGAAGGTAAAAATCAATTGAGGATTACACAAAGGCTTCGTCTGGATCAGCCAAAGTTAGATGATGTACAAAAACGTATTGCTACTTCTAGTTCACACGCTATCTTCCTTGGTGTGGCAGGATCAACGGCTTCCATTACAAATGAAGATGCAAGCATACAGACAAGGCCTATGAGGAATTTAGTTTCCTATTTGAAACAAAAAGAGGCTGCTGGAGTTATATCATTGTTGAATAAAGAAACTGAAGCCACTGGGGTTTTGTACTCTTTCCCTCCCTGTGACTTCTCCACGGAACTGCTCAAGAGAACTTGTCACAACCTGACTGAGGAGAGTTTGAAGGAGGATCATTTAGTTATAGTGGTAGTAAGGGGCGAGGAGGATGCTAAAAAATATGAAGCCTATTTGGCTGCTCTTAAGCAACTTCGAAAGTCTTCGTTCGTAGAGGTTACTCCGAGAGAAAAACACAGAAATCAGGAATCCAAGAAACAAGCGTTGCTAGATGGTATATCAGGTACTTGGAGTACTGTCCAACAGGCGACAAGAATTGCGATCATAGCTTCCCTGGTGCCGGATGAGATTATTTACAGACATTCAGACCATTCCGTTAGAAGTTATGAGACCGCGCTCATGTTTATAGATGTCTCTGGTTTTACCAAGCTATGTGAGACTTATACGAAAACCGGTGGTGGCCCTTCGAGGCTTACCCAAGTTCTTAATTCTTACATTGGTGCTATGGTTCAAGAAATTTTAACGCATAAGGGGGATGTTTTAAAGTTTTCTGGCGACGCCTTTTTATCAATGTGGAAGAAATCTCCCCGATTAAACATGCAAGATGTCGTTCACACCGCTATTGACTGTGGTTTGTTAATTCAAAAAAATTACGGAAGATACATGACTGACGTTGGAGTGGTTCTAAAAGTCAAAGTCGCTATATCCGCCGGTTTGTCCCATTTTTCTATAATCGGTGGTGGTAATATATCCCAAACGCAATACGTAATAGTCGGTCAGCCGGTGTGGGACGTCAAAATGGCGGAATATATGAGTGCAGCTGGTGACGTTTTAACGTCAGCCAGTGCTTGGATGTATGTCAATGAGGCGGAATATTGTACACAGCCATGCGGAGATGGTAGACATACTAAGGTATTGGGTGTTGGCGCTTCTTGGAAAAGAGTAGAGAAACTGCGTTTTTCTCTAGGAATGAATAAAGAACCAGACTGCTTTAGTAACGAAAATTTATCACTTGAAAATTTTACTGTTTCCGGCATTAATTATCGAGAGTATGCACATCGTCCAGCTGTGGTAGCAGCGATGCGTGGTACTTGGTGGCCGGCTCTACGTCAATTCATGGTACCGCCAATATTACGAGCGGTTGACAACGACGAACCTATGGACTTTCTCACCGAAGTCCGCCATGTTGTTGTTGTCTGTATAAATATAATAACAAGAACTGTCACAGAGACTGTACTCATTGAGGTTGTTGATACCGCTTACAAATGCGTCTACAGCGTGACGTCAGAGGCTGGCGGTCTCGTCAACAAAATCTCAATGTTCGACAAGGACATGATGTTGCTTGTAGTTTTCGGCTTAAGAGGACTCAAGCATGAGGACGAAGCCCAAAAGGCACTTCAATGTGCGTCTCAGTTAAAGGAATCCCTTGATGATGTTAATATTATAAACGTTAGCATTGCAGTTACCTCGGGACTAACATATTGCGGCGTTGTCGGTCATGTACTGAGAAGAGAATATACTGTCATAGGATCAGCTGTCAACAAGGCTGCTCGTTTAATGATGGCGTATCCGAATAAAGTGACCTGTGATAAAGAAACTTTTTTAAAGAGCAAAATAAATCAGGAGTGTTTTAAATTGGTGGAGACCAAACCTTTAAAGGGAATATGTAAACCTGGTCCAATATACGAATTCAGTAATCCTAGAAAGACGGAAAGAATTACATACTGCCGTCATCCGATTCTTGGTCGTAACGAGGAACTGCGAAAATACAAGATGACCTTACACAATGCGTTGGACGAACATCCGAAAAGCTTCACCAGATATAGAGACCATAAATTCGGCGTAGCGTTTATTGGACCAAAATTGGTTGGAAAGACACGTCTCATGCAAGAGTGTATAAACATTACTCCGTCGTTTGTTTTGGTTGATCATTTTGTTCTAACAGAGAAAGACAAGCTAAAGTTTGGAATAATACGATTAATAATGAAATCGATTTTCAAATGCGGTGGGAAATTGTTGAGAGAAAATCGCGAGAATAGAATATTGACATCTATTGACATGACGTCATTAGGGCCTCTGGAGATATACGGTATAAACACCGTGTTCGACTGTCGCTTCCCGTTACCCGAAAATTACGCTCCAACGTGCAAATTACTCGATCAATTTAAAGTCAAGGAAGTCATTAAGGAAATATGTAGGGTGAATCTGCCGTCTCTACGCGTTGTAGCGGTCGCTGAAGGTCAATATATCGATGATGATTCCTGGCAAATTATAATTCTTCTTTTGGGGGCTAAACTAATTTTTCTACTAGTCAGCATATCAGAAGAAGAAACACTCTCTGCCACAGCTACAATATGCTTAGCTAACGCTATGATAATCAAACTGCCGCTATCGGGAATCGATCGGTGGTACCACGCGGCGTTGGCCTGCCAGCTTCTGGACGTACAGGCGATACAGTCGGATCTGGAAAAGATCATTGAAAGTGCAAGTGAAGGTTTGCCGGGGTGGATTCAGAACTTTGTCATATCATTAGTTCAGAGAGGTCAATTAACAATGATGACCATGTCTCGATCAGAGGCGCTGGAGATGGGAGCCGTGACACCGTCACCAGCGTTACTTGAGACGGACACCACTAGTACGTCGTTTGAAGATATCGAATGCAGCAAGGATAGCTACTCTTACGTACTTAAACAAGGCTCGGTGGCAGAAAACGAGATGATACAAATGGCGGTGCTGACTGACACGTACGACTTCGAGAACATGAAAGTTGACGTGAAAATGGACGCGCTTATTTTGAAGACATACGATTCCTTAACGCCTTTCGAGAAAATGCTATTGAAATGTGGCTCAGTGCTGGGCGAAGTGTTCTCGCGCTGCATGCTTTTACACTTGCTGCAGAGCGATTCCCCTCGGAGAGTAGCTCAAGCTCATTGTCAAGATCTTCCATCGTACGCGTTCTGTGGGTACATGAAATTTAGACACAACATGTTTAGGACAACCACGTATGAATTGTTGACTGAGAGTCAGAAGGGGTTAATACATGAGAGCAAGGAACTAAACCAGATCCGCGAGCAGATTTGTGCATTGAGCACCGAGACTAAGATGACGAGTGATAATAGCGCGGTTGATGCATTTTCCCAGTACCAGATGTCAATCAGAAGCGAATCAAATATTCGTGCGTTACTAGATTCTGAGGATTTAAGGCGCTTGAGTCGCTCGATGCAAATGTATCGTAAAGATAAGCGTATAAGATCCTTCTCGTCTCTTGAGCTGAGTATTTGTGAATGTTTGCCGATACTTCTCTCGGCTTATTCACAGGCTATAGAACATTGTCACGGCGCAGATGATTCTGAAAAATTATTCGAAGCCTATTTAGAGTACGCCGACTTGAGCATAATCAACATGAACATACCGCAGGCTGTTCACTTACTCTCTAAAGTAGAGGAGTTCGTTTTGAGTGATGCGAGTTCTAAGAAAAACGAGTTCAAATGGGTCAAAGATTTCAAACTGGGTCGCATACATTCGTTGCGCGGCGCTTGTTTGCTCGAGTGTGGTGACTTAGATCAAGCGAGGAAGGAATTGTTACAGGCTATGCGGCTGTTCTGTGATCCCTTCCCGAGTTCCAAAAACGCGGTGCGGTTCAGAAATTTGAGGGCCTCGTTCAGTCAGATAATGGCACTGTTCATAGTACCTCAGATGTATGTGGCGACTACCAGCGGTTTTGTTGGGGATTTTTACGAAGCTATCGCCTGGACGCTCAACAGGTTGTACAGGTTATTCAATGTAAGTGATGTACAGCATATTCTGCGTATTAATTTAAGGAATAAACGTTAA

Protein sequence:

>DPOGS214304-PA
MHEYPMAGPHGPPMHHRPPMHHPPHPHYMPRPYMPRPHHPPFEKMENKKDKFPNYLHHVQPEDDPLATRTLFAGNLEINISDEELRRIFGRYGIVEDIDIKRPPPGTGNAFAFVRYQTLDMAHRAKVELSGQYIGKFQCKIGYGKATPTTRVWVGGLGPWTSVAQLEREFDRFGAIKKIEYAKGEPHAYILYDSIDAAQAAVKEMRGFPLGGPDRRLRIDFADVGTGGPYRPKPYAAPVEEGRSEGYEGYEGSWEDGYSYGSGYRGRGGHRGRGRGMYRGVYHGSADYRDEEWRRAPDAEYDSRARRSGSREPGVDRSRSRSPRRRSPDSDSDGSPRRSSGMLASARTLPEVVRKATTIWNGALILKNSLFPTKFHLTDGDSDIIDSLMKDEEGKNQLRITQRLRLDQPKLDDVQKRIATSSSHAIFLGVAGSTASITNEDASIQTRPMRNLVSYLKQKEAAGVISLLNKETEATGVLYSFPPCDFSTELLKRTCHNLTEESLKEDHLVIVVVRGEEDAKKYEAYLAALKQLRKSSFVEVTPREKHRNQESKKQALLDGISGTWSTVQQATRIAIIASLVPDEIIYRHSDHSVRSYETALMFIDVSGFTKLCETYTKTGGGPSRLTQVLNSYIGAMVQEILTHKGDVLKFSGDAFLSMWKKSPRLNMQDVVHTAIDCGLLIQKNYGRYMTDVGVVLKVKVAISAGLSHFSIIGGGNISQTQYVIVGQPVWDVKMAEYMSAAGDVLTSASAWMYVNEAEYCTQPCGDGRHTKVLGVGASWKRVEKLRFSLGMNKEPDCFSNENLSLENFTVSGINYREYAHRPAVVAAMRGTWWPALRQFMVPPILRAVDNDEPMDFLTEVRHVVVVCINIITRTVTETVLIEVVDTAYKCVYSVTSEAGGLVNKISMFDKDMMLLVVFGLRGLKHEDEAQKALQCASQLKESLDDVNIINVSIAVTSGLTYCGVVGHVLRREYTVIGSAVNKAARLMMAYPNKVTCDKETFLKSKINQECFKLVETKPLKGICKPGPIYEFSNPRKTERITYCRHPILGRNEELRKYKMTLHNALDEHPKSFTRYRDHKFGVAFIGPKLVGKTRLMQECINITPSFVLVDHFVLTEKDKLKFGIIRLIMKSIFKCGGKLLRENRENRILTSIDMTSLGPLEIYGINTVFDCRFPLPENYAPTCKLLDQFKVKEVIKEICRVNLPSLRVVAVAEGQYIDDDSWQIIILLLGAKLIFLLVSISEEETLSATATICLANAMIIKLPLSGIDRWYHAALACQLLDVQAIQSDLEKIIESASEGLPGWIQNFVISLVQRGQLTMMTMSRSEALEMGAVTPSPALLETDTTSTSFEDIECSKDSYSYVLKQGSVAENEMIQMAVLTDTYDFENMKVDVKMDALILKTYDSLTPFEKMLLKCGSVLGEVFSRCMLLHLLQSDSPRRVAQAHCQDLPSYAFCGYMKFRHNMFRTTTYELLTESQKGLIHESKELNQIREQICALSTETKMTSDNSAVDAFSQYQMSIRSESNIRALLDSEDLRRLSRSMQMYRKDKRIRSFSSLELSICECLPILLSAYSQAIEHCHGADDSEKLFEAYLEYADLSIINMNIPQAVHLLSKVEEFVLSDASSKKNEFKWVKDFKLGRIHSLRGACLLECGDLDQARKELLQAMRLFCDPFPSSKNAVRFRNLRASFSQIMALFIVPQMYVATTSGFVGDFYEAIAWTLNRLYRLFNVSDVQHILRINLRNKR-