Monarch geneset OGS2.0

DPOGS210367
TranscriptDPOGS210367-TA2178 bp
ProteinDPOGS210367-PA725 aa
Genomic positionDPSCF300025 + 490474-494390
RNAseq coverage438x (Rank: top 28%)
Annotation
HeliconiusHMEL0138383e-16154.74% 
BombyxBGIBMGA011924-TA3e-15248.62% 
Drosophilamars-PA6e-2236.81% 
EBI UniRef50UniRef50_UPI000206407B7e-3127.74%UPI000206407B related cluster n=1 Tax=unknown RepID=UPI000206407B
NCBI RefSeqXP_001961090.16e-2336.17%GF13697 [Drosophila ananassae]
NCBI nr blastpgi|3838554007e-3526.42%PREDICTED: uncharacterized protein LOC100880148 [Megachile rotundata]
NCBI nr blastxgi|3838554001e-4226.00%PREDICTED: uncharacterized protein LOC100880148 [Megachile rotundata]
Group
Gene OntologyGO:00072671.7e-43cell-cell signaling
KEGG pathway 
InterPro domain[155-497] IPR0050261.7e-43Guanylate-kinase-associated protein
Orthology groupMCL25902 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210367-TA
ATGCATGCGGTCAAGAAACGGTTGGAGACCAGCATTAGGAATAGAAGGACTACTCGGTTTTCTGTATTTGACAAAATAAGGAATCTTCCAAGATGTGAGAGCCCAGCTCCAATACCTGAAGTACAGAAAAAAGTGGAACACAGGAGAATGCAGCTAGAAAAATGGAAGGAAGAGAAAGAAAAGAAAAAGAAGGAGGCGTCTTTACAGAAGAAGAAACCATTCATAGCTGGAGTACCACACAATCCATTGAAGTTTGTTCCTCCTCCACCACCCAAACCTATGCCTAGCACATCAGGGAGAGTTACTCGATCACAGTCCTCAAAGAACAACTCGGTTAACAATGTTAAGAAACCCAACAAGACTGAGGGGAAGTCAAAAATCATTGTCTCATCGTTTGCACCAAAAAATGCTGTTTTCCATCCACCAGTACTGAAAAATATGGTTAACTTACCAACATTATCTGTTTCTAAGCAAATAGAAAAAAAAGCAAACACAAATATGGCTGAGAAAAAAAACAAAGAGTCCAAGAGTAAAATCCTAAAAGACAATTCAAAACAGAATGAATCCAGGGTACTTCGTAATAGGCCTCAATCAGACAAAGGCCAGGGTAAAACACAGGTTACTAAAAATAAGAAAGCTGACGCTTGCAAAGAATCCACCTCATCTTCCTCAAGCAGTGTAGAATCTAATTATGATGAGCCTAATGCTTCAAAATCTCTGAAATCTAGGACTCCTAGAAAATCCTTACAAAATAAACAAAATATGGTCACCCCCAAAAATGTGCCAAAGAGTGAATCAAGCTCTGAAGAGAAATTACGATCTCCAAAACTAATAGAAATCCCGATGACTCCAGAACAGATTGCCGAAGAAGCTAAGAAAATAAGTCCCTGTGTGACACTCTCACGGGGCAAGGATAACGCCAGGAGAGAGATGAAGAAGAAGTTAGAGGAAGGTCTTTTGGACGAGGATCTATGTGAGATGGACAGTGTCGAACATTTCCGCCGGCAATTGGACTCCGAAATTGCTCGTATGACGGAAATGTGTGAGGCTTGGGAGAAGATCTCACACCAGATAGCACTACCAGAAACCGTGCAGGAAGCTGTGTTGTCCGCCGTGGGTCAGGCTCGACTGCTGATGTCTCAGAAGTTGCAGCAGTTTGCATCTCTATTGTCCCGCTGTGAACACCCCACGCCCAACAGCGGGCTCGTCACACCCAGTGACCTTCACGGCTTCTGGGATATGGTGTTCATGCAGATAGAGAACGTTGATATGCGTTTCCGTAAGTTGGAAGAGTTGCGTTCTCGTGGCTGGAGCGAAGACCAGCCCCCGCCGCGGGCGACACGTCCAGTTCCGCGGCCCTCCCGCCCTCAACCAGCCGCCCGCCCCGCTCCAGCCGCTCGTCCCGGAAACAGCCGCCTCAAGGATCTCATAGCAGCGGCCCGTAAAGCCAAACAAGCCAAGTGCTCCGAGGAGGTTTCCATGGAGGAAAGTAAAACTATAGACATGGGCTTCTTCTGTATTCAATCTCCAGTGAAGTCACCCCTTCAGGTGACTCCGAGCAAACCCAGCTTGTTGAAGGCTGTGCTGTCTAATGAAGCAGAGAAGTCTGCCAATAGGAAGTCTGCATCGTTTGCTATGCTCCGAGCGTCGCTCATAGGACGTCAGGTGGAAAGCGAAGGGTCCAGTGATGAGGAGCATAATCTGATAACGTTCACTCCGGTTGACCTCGGAGCTACTCCAGGTCGCAGTATACTCAAAAACAAACCGTCCACCAAGAAATCAGCCAAAAAATCTATCAAAGTCGTGCTCTTCAACGAGTCGGACACGGAGCTGCAGAACAATTCAATGAGCTCGGACAAAGCTCTCGAAGCAGAAGATGTAGAGACACAGGAAGGCCAAAAACTTCAGATGGAACACAACACGGACAGCGGCATCTCGTCGATGGATATAGAAAACGACACCGAGAAGGAAAACAAGGGTAGGAGGAGGTCCAGACTCACCAGGCAGGACGCCACGGAGGAGAGGAGCCCCGTCATGACCCGGAGCAGGAGGAAGAGCATACTCACACCTGGCAAGGAAGTGGCCGCCAGGAAGAACAACACTCTCAAAGAAGCAAACCTTGAACATAATACGACAACGAGGAGGTCCACACGGAAGAGTATTCATGACGATCATTGA

Protein sequence:

>DPOGS210367-PA
MHAVKKRLETSIRNRRTTRFSVFDKIRNLPRCESPAPIPEVQKKVEHRRMQLEKWKEEKEKKKKEASLQKKKPFIAGVPHNPLKFVPPPPPKPMPSTSGRVTRSQSSKNNSVNNVKKPNKTEGKSKIIVSSFAPKNAVFHPPVLKNMVNLPTLSVSKQIEKKANTNMAEKKNKESKSKILKDNSKQNESRVLRNRPQSDKGQGKTQVTKNKKADACKESTSSSSSSVESNYDEPNASKSLKSRTPRKSLQNKQNMVTPKNVPKSESSSEEKLRSPKLIEIPMTPEQIAEEAKKISPCVTLSRGKDNARREMKKKLEEGLLDEDLCEMDSVEHFRRQLDSEIARMTEMCEAWEKISHQIALPETVQEAVLSAVGQARLLMSQKLQQFASLLSRCEHPTPNSGLVTPSDLHGFWDMVFMQIENVDMRFRKLEELRSRGWSEDQPPPRATRPVPRPSRPQPAARPAPAARPGNSRLKDLIAAARKAKQAKCSEEVSMEESKTIDMGFFCIQSPVKSPLQVTPSKPSLLKAVLSNEAEKSANRKSASFAMLRASLIGRQVESEGSSDEEHNLITFTPVDLGATPGRSILKNKPSTKKSAKKSIKVVLFNESDTELQNNSMSSDKALEAEDVETQEGQKLQMEHNTDSGISSMDIENDTEKENKGRRRSRLTRQDATEERSPVMTRSRRKSILTPGKEVAARKNNTLKEANLEHNTTTRRSTRKSIHDDH-