Monarch geneset OGS2.0

DPOGS210730
TranscriptDPOGS210730-TA1983 bp
ProteinDPOGS210730-PA660 aa
Genomic positionDPSCF300013 + 105454-113503
RNAseq coverage2109x (Rank: top 6%)
Annotation
HeliconiusHMEL0161691e-9671.37% 
BombyxBGIBMGA006260-TA2e-4884.91% 
DrosophilaCG17838-PB2e-10963.33% 
EBI UniRef50UniRef50_O605066e-6149.79%Heterogeneous nuclear ribonucleoprotein Q n=114 Tax=Eumetazoa RepID=HNRPQ_HUMAN
NCBI RefSeqXP_002137140.17e-11064.71%GA27045 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984507021e-10864.71%GA27045 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|3071732504e-12553.75%Heterogeneous nuclear ribonucleoprotein Q [Camponotus floridanus]
Group
Gene OntologyGO:00001663.6e-28nucleotide binding
GO:00036763e-21nucleic acid binding
KEGG pathway 
InterPro domain[277-417] IPR0126773.6e-28Nucleotide-binding, alpha-beta plait
[342-407] IPR0005043e-21RNA recognition motif domain
Orthology groupMCL10769 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210730-TA
ATGAGGCCTCGAGCCAACGAGCGTGACAGGCCGCTGTGTTACCACTTCCTACGTTCCCTATTGGACTCTCGAACCGTGGCGAGGTTGGTGGCGTCCCGCGCCCTTGTCTATAACAGCTTTGTAACCGGCACGATAGCTGACTGCGGTTTGGGGACTCGCCTCCTTGTACATTACCCTCGAGTGAAATCATTGGCGACGTCCGAGGCCTCGCTTAGGCTCGCTACCGCGACGGTTTTGCAGTTCAGGCTCGAGACATGGTCCGCTCCACACGTTCAGTGTCGCGCGCTCGGGGCACTGGTCGCGTTCGCTGTCGTGGATGGTTTCACTCGCAAACCGAAGTCTCCAAACTGCAGCGCGTCTTGCAGGCCTACCGCCCGCCCAGCGACGGTAGTCTCGCAAGACGCGCATGTCATTAATATCCCCATCGACTTCGGCCATAAATATGCGGAGTGTGACGTCGCGACGGCCCGGCGGCCGGCCCCGGGCGCGGCGCCGCGCGTATCTATGGCCGGAGTCCCCACTGTAATCACTGATCTTGTGTACATGTGCAACGTAACACCCCTCGATAATCACGAAATAAAACCGGGGAAGACTCTGAGAATTAAGATTAGCGTACCGAACCTTCGACTTTTCGTCGGCAACATTCCCAAGTCTAAAGGCAAAGAGGAGATACTGGAAGAGTTTGGTAAATTAACAGGGGAAAAAGATTGGCGTTACGCACGTCCGCGCCGCGCCCGCGCCCCGCCCGCGCCCCGCCGACCCGCCGTAGTCGCTAACATGACATTGGCCTCCGCCTCGAGAATGCTCACTGCCGGACTCGTTGAAGTCATTATATATAGTTCGCCCGATGACAAGAAGAAAAATAGAGGATTTTGTTTTTTGGAATACGAGTCTCACAAAGCGGCGTCACTAGCCAAGCGTCGGCTGGGCACCGGCAGGATTAAAGTTTGGGGCTGTGATATTATAGTGGACTGGGCGGACCCGCAGGAGGAACCCGACGAGCAGACCATGAGCAAAGTGAAGGTGTTGTACGTTCGGAACCTGACCCAAGAAATCACAGAAGAAGCGCTTAAAGAAGAATTCGAACGTTATGGAAATGTAGAACGAGTTAAGAAAATTAAGGATTACGCTTTCGTACACTTCGAAGACCGGGATTGTGCCGTTAAGGCGATGCAGGAGATAGACGGCAAGGAGCTGGGTGGAGCCCGCCTCGAGGTGTCGCTGGCCAAGCCACCCTCGGACAAGAAGAAGAAGGAGGAGATACTGAGGGCGAGGGAGAGACGCATGACGCAGATGATATACGGACGGGGCGGATTTGATTGGTGCAGCTGCTCGCCGGTGCACGGGGCGCTCCGGGGCCGCACGCCGCAGCCGCAGCCGCGCCCGCCGCAGGCCCGCGGGGACTACGATTATGATTACGACTATTACGGGTACGGGGATTACCGAGGTGGCTACAATGAGCCATTTTACCGGTACGATGAGTTCTATTTTGATTACGCGGGGCCACCGCAACCGTCCGCCGTCCGCCAGCCTCCCAACAGAGCGCAGCCGCAAAGAATTGTGATTGAGAGTGTGCGAGTAGTATCCACGGCCGGTCGTTCAGAGCGAGCGATGTCCGGGGCCGAGCTGACTCCACTGTGTGTGATGTCCAGGGGGCTGGGTCATGTGGGACGGGCGCGCGCTGGGGGCGGCGCGGGGCCGCGGCTGGGGCCCGTGGTGGTGCGCGCCGCGCCGCTCGTGGCCGCCGCACGCCCAGCGGCATGCGTGGCAACCCGCGCGCCAAGCCAAGTTTACCAGGTACGTACACGGGCACAGGAACAGACACGACGTAAACGTAAACTCGACGGGGGTCAGCAGATCGCTGGGGGGGAGCGGGAGAGCAAGCGGCGACTGGGCGCGGCGGCGGCGGCGGCGCGCGGCTGGGGGTCGGCGGGGGTAGGATCGATGGGGTCCATGGGGTCCGAAGGTGCCGCCGCCAGCTAG

Protein sequence:

>DPOGS210730-PA
MRPRANERDRPLCYHFLRSLLDSRTVARLVASRALVYNSFVTGTIADCGLGTRLLVHYPRVKSLATSEASLRLATATVLQFRLETWSAPHVQCRALGALVAFAVVDGFTRKPKSPNCSASCRPTARPATVVSQDAHVINIPIDFGHKYAECDVATARRPAPGAAPRVSMAGVPTVITDLVYMCNVTPLDNHEIKPGKTLRIKISVPNLRLFVGNIPKSKGKEEILEEFGKLTGEKDWRYARPRRARAPPAPRRPAVVANMTLASASRMLTAGLVEVIIYSSPDDKKKNRGFCFLEYESHKAASLAKRRLGTGRIKVWGCDIIVDWADPQEEPDEQTMSKVKVLYVRNLTQEITEEALKEEFERYGNVERVKKIKDYAFVHFEDRDCAVKAMQEIDGKELGGARLEVSLAKPPSDKKKKEEILRARERRMTQMIYGRGGFDWCSCSPVHGALRGRTPQPQPRPPQARGDYDYDYDYYGYGDYRGGYNEPFYRYDEFYFDYAGPPQPSAVRQPPNRAQPQRIVIESVRVVSTAGRSERAMSGAELTPLCVMSRGLGHVGRARAGGGAGPRLGPVVVRAAPLVAAARPAACVATRAPSQVYQVRTRAQEQTRRKRKLDGGQQIAGGERESKRRLGAAAAAARGWGSAGVGSMGSMGSEGAAAS-