Monarch geneset OGS2.0

DPOGS211150
TranscriptDPOGS211150-TA3063 bp
ProteinDPOGS211150-PA1020 aa
Genomic positionDPSCF300007 - 38918-43905
RNAseq coverage448x (Rank: top 27%)
Annotation
HeliconiusHMEL0171940.081.68% 
BombyxBGIBMGA003023-TA0.069.74% 
DrosophilaCG7358-PA2e-2934.88% 
EBI UniRef50UniRef50_UPI00021A79351e-3451.23%UPI00021A7935 related cluster n=3 Tax=unknown RepID=UPI00021A7935
NCBI RefSeqXP_971684.19e-3355.78%PREDICTED: similar to CG7358 CG7358-PA [Tribolium castaneum]
NCBI nr blastpgi|3287855011e-3451.85%PREDICTED: hypothetical protein LOC552765 [Apis mellifera]
NCBI nr blastxgi|910821955e-12835.00%PREDICTED: similar to CG7358 CG7358-PA [Tribolium castaneum]
Group
Gene OntologyGO:00082703.6e-05zinc ion binding
GO:00036763.6e-05nucleic acid binding
KEGG pathway 
Orthology groupMCL26740 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211150-TA
ATGTCAAAACAAGGAAAAAGAAAAATTACTGTCGAGTTACCGAAAACTAAAGATCTCTCAAATCGACCAAGTGTTTTTGAGCGTCTCGGTACTAAAAAATCTTCTGGTACTAAAAAGGCTGCGGAATATTGCAGGCAGTGGGCCCAACACGGTACTTGTGCTTACGGTAAAAATTGTAAGTTTGCTACAACCCACACTCTAATAAGCCCATCTAAGCAAAGGGCAGCTAATAAAGATAGTGAAAGTAAACGATTGGTTAAAGATGATGCTAAGAACCGATTGCATTCAACAGTAGTCGTTCGGTCAGGTCGTAGTCCTGATGGCGAGATCGATAATTGGGATCAAAATGACTTAGAATATGCAGATACAGACGTTCTTGAGAAGAGAAGACAGCAGTTGCAGCGTGAATTAGAGTTGCAGTTAAAAATGGATTCCAACAAGGATGTACGTAAAGACAAGAAGAAAGCAGTGTCAAGTTCTTCTTCATCAAGAAGTTCCAGCTCTTCTAGCGTTTCTTCATCTTCTTCAGATGGCAGCTCATCATCGTCGTCATCTGGCAGAGATGCTAAGAGAAGAAGTAAAAAAGGCAAACTAAAAAGAAACTCGAGTTCATCATCTGAATGTGATATTCCTACAAAAAAAAAGTTAGTCAAAGCTAATACTTTAAAAAGAGACAAAAGTGAAACCAAAAAGATTACTGTGAAAAAGGATGACAGGTTATTAAAGAAACAAGATACCAATAAAAAGAAGAATCTTTCTAAGGCAATTGGCAAGAAATCATTGTCCCCTGGTAAAAAATCTGGCGGAACACCACCAATAACTTCAAAGAGTCTAACATCTAAAAGCACTTCTTCAAAACATGTTAAATCCCCACAAAGAAAAGAGAGGGTTAAAAGTGTATCTCCTCATTCAGCTCGAGATAGAGATCGAGATAGGGATCGCGCTAAGGATAAAGAGAAAGAAAAAGAAAGAGAAAAAGATAAAGACCGTGAAAAAGAAAGAGATCGCGTTAGAGGGCGGAGTCGATCGCCCAAAAAGGCTCGATCTAGGTCCCCTCGTAGACATGGTTCACACCCAAAAGACCCAAGAAGAAAAGAAAGTGCGGAAAGAAGTAGACCAGTTTCACCGAAAAAGCAGTTGCGTCGAGATGGAAGTCCTGAACGGCATAGAAAACGTTCAACGAGTAGAGATCGAGATAGAGGCCGTGATCATAAAGACAGATCTCTGGAAAGAAAGAAAGATAAACACGATGATAAAGATCGTCACGATAGGAAGGATAGAGGACGGGAGAGAAATAAGGATAGTAAAAGGGACGATCCCAAGAGGAGAGGCCGTGATCGGGGTCTGGGAGGTAATAGAGGTGACAAAGGTTTGGAAAAACCTAATAAGCCAATGCAAAGATTATTACCACGTCCCGAAGAACGTTTGGCTGCTCTTGCAGCCATTTCGAACCGGGCCTTAGAAACTGATAAAAATTCAACTAATTCAAAAGATCGTCAGGATACACATTCCGAGAGGGTTGGACGGAAGCAAGACAGACTAGAAAGAGACAGATCTAATAAGAGAGAACGACTTGACAGTATAGATAGAGAACAGGGTGATTATGAATTAGGAATGGATCATCAATATGAACATGGACACGGAATTGAGCATTACGACAGGATGGAAGATCGGTATGAAGGTGTTACTAGAGAAGGCGATAGGTCTCCAGGCTACATACAGGGCAGGGACAGGCACTATGATCCCAACTACGACATGCCGGGCCCAGCTAGAGGTTATGCTGAAGACGACGAGAGAATGTACGGCGAATCAATGGGAGATCCTCGTGGAGTCGATATGGGTTACGATGACCGCAGACCTCATCGTGATCGCTCGTGGGAAGGACGCTCGGGTTCTTTGGACCGCGAGCGGGGTTATATCCACTCTCACAAAGATTGGGATAACGATGATTACCGCGGTCCTGGCGATTGGGGGCGAGATCGCAACTGGCAAATGCATGAAGGCCAGATGAACGAGTGGGGTGACAAAGAGCAGGATATGGAAGGTTGGCATCACCGTGGTCATGGGCCAGGACCACACCGAGGACGTATGCATGATGATTATAATCGCGGGGGGAGAATGGATCATGGCAGAAATGACGGTAATGTGAGGCGAGCTCAAAGAACCGAAACGGACACCTCTAAGGAGACGCCAACAGCAGCTCCAACAACTGCAACAGTACAGTCCGAGGAAGTTGAAAACGAAGATAAACCTATTCCTGACGACCTCAGTGAGATCAGTGATGATCCTGATGATATTCTAGAAAGAGAAGATATGCTGGATCAAAATATTGATGAAAATAGTCAAACTGATCAAGCAATGGAAGAACCGCCGACGAAGCCAGATGGATCAGAGAAGGAATCAATCCAGGATACAAGTGGGATTGGCGAAGAAAAAGAATCACATGAAGTCAAGGATGAGGAAGATGTTACTAACTTGGACTTTGAAGAAATATCTGACGGAGAGCTGGAGGAGGAACCTGGATTGGGAGATGCTTTGGGTGTGGACTGGGCAAGTCTTGTAGCAGACGTACGTCGACGCGAACAAACGGCACCTACAGGTGGCACTCGTGACAGGTGGCGTCCTGAACGTGTGCTTTCAAGATTAGGACTCTCAGTTGACATGGCTGGGAGAGAAGCTGTGCAAAAAATACTTCAAGAGAATAAAGAATCCCTAGCATCAGAAACAAAAAGTACAGAAAATGGTGTTGATGATTCGAATGTTGTGAATGGAAATCATGATGAAAACGAAACAAAGGCCAACACTCCCATGAATGTGGATGATTTGCATCCGGTGGCTGTAATTCAGGTGGCAATGGAGAAAAGGAAACGTCAGCGCGCAGTATTATTTGGCTCTGGTTGCGAGATAACCAGGGGTCTAAGCGCGAGGCGTGATTTGGCATTACGGAGGTACCTCTGCCATTTACCAGTGGCGGAACGGACAGGATCATCACGGGTCACTCCGCAACCTACACTGTTCCATGCAGCTCGAGCTTTGCTTATGCAAGCTCCTGTCACTGGCTGA

Protein sequence:

>DPOGS211150-PA
MSKQGKRKITVELPKTKDLSNRPSVFERLGTKKSSGTKKAAEYCRQWAQHGTCAYGKNCKFATTHTLISPSKQRAANKDSESKRLVKDDAKNRLHSTVVVRSGRSPDGEIDNWDQNDLEYADTDVLEKRRQQLQRELELQLKMDSNKDVRKDKKKAVSSSSSSRSSSSSSVSSSSSDGSSSSSSSGRDAKRRSKKGKLKRNSSSSSECDIPTKKKLVKANTLKRDKSETKKITVKKDDRLLKKQDTNKKKNLSKAIGKKSLSPGKKSGGTPPITSKSLTSKSTSSKHVKSPQRKERVKSVSPHSARDRDRDRDRAKDKEKEKEREKDKDREKERDRVRGRSRSPKKARSRSPRRHGSHPKDPRRKESAERSRPVSPKKQLRRDGSPERHRKRSTSRDRDRGRDHKDRSLERKKDKHDDKDRHDRKDRGRERNKDSKRDDPKRRGRDRGLGGNRGDKGLEKPNKPMQRLLPRPEERLAALAAISNRALETDKNSTNSKDRQDTHSERVGRKQDRLERDRSNKRERLDSIDREQGDYELGMDHQYEHGHGIEHYDRMEDRYEGVTREGDRSPGYIQGRDRHYDPNYDMPGPARGYAEDDERMYGESMGDPRGVDMGYDDRRPHRDRSWEGRSGSLDRERGYIHSHKDWDNDDYRGPGDWGRDRNWQMHEGQMNEWGDKEQDMEGWHHRGHGPGPHRGRMHDDYNRGGRMDHGRNDGNVRRAQRTETDTSKETPTAAPTTATVQSEEVENEDKPIPDDLSEISDDPDDILEREDMLDQNIDENSQTDQAMEEPPTKPDGSEKESIQDTSGIGEEKESHEVKDEEDVTNLDFEEISDGELEEEPGLGDALGVDWASLVADVRRREQTAPTGGTRDRWRPERVLSRLGLSVDMAGREAVQKILQENKESLASETKSTENGVDDSNVVNGNHDENETKANTPMNVDDLHPVAVIQVAMEKRKRQRAVLFGSGCEITRGLSARRDLALRRYLCHLPVAERTGSSRVTPQPTLFHAARALLMQAPVTG-