Monarch geneset OGS2.0

DPOGS207371
TranscriptDPOGS207371-TA1668 bp
ProteinDPOGS207371-PA555 aa
Genomic positionDPSCF300267 - 119594-122592
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0122460.088.75% 
BombyxBGIBMGA009004-TA0.076.21% 
Drosophilaqtc-PF1e-11646.90% 
EBI UniRef50UniRef50_Q16G881e-11651.49%Putative uncharacterized protein (Fragment) n=1 Tax=Aedes aegypti RepID=Q16G88_AEDAE
NCBI RefSeqXP_314710.34e-13153.93%AGAP008614-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187855128e-13053.93%AGAP008614-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1187855123e-12753.93%AGAP008614-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055153.9e-07protein binding
KEGG pathway 
InterPro domain[508-549] IPR0002373.9e-07GRIP
Orthology groupMCL12645 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207371-TA
ATGAATAGAAACTGTGGTAGAAAGACAGAGTTTCAAGACGAGCACTACTGCGCCGGAAATGTCACTGGGAATTTCCGACGCGCTTGTTCACTGCGGCTGCGCGGTGAAAAAATGGTGCAACGATCCCCTTTAACAACTAGAAAATATATTCCAATAATCACGGAAAATACAAACCAGAAACAAAGAAATGGTTCCCGTCTTCCGGAGCCGAGCTACCGCGCAAGGAGTCAATCATTTAACTCCACACAAAAACCGAAAAAATCCTGTTTAAAATTCCAAGAAACCTCCATAGATAGAAACCTCAGTATGACAGACTCGGCCCACACACCACCAGGATCTCCAGAAGATTTACCCGACGATGAATCTTTACATAGTTACGGAAGCGCAGCCACGGCCGCCTCTATTGATGCTGGATATGCACCATTCAATGGCACTACTTTCAGTGGTAGATCGATGCGTTATGTCTTACACTGTTCCTCACATGCCGGCTTAGCTGGAGAAGATTATCTCACACCAACCCAGAGAGCCCAAAAGCAAATACGTCGATTGAAATCTCTGTTAGCTCAGGCCAAGAAGGATTTAGAAGAAAAGGATAGTGAAATTTTTCAACTTACAAAAGAAGTTGTTGAGCTCCGTCTCTACAAAGCTTCAATATTTTCTCCAGATGAAAAGTCAAATTCAAGTGAAATTGTGACCGTCCGAGAAAATAATGATGAAGCTTCCATAGAAGAAGAGAGCATCAAATGTAAAAGTGCCCTTCGGTCATTTGACATCACTGATAGTCCTTTGTACAAAGATCAAACACCAACAAGATGCCGCAATGAGATGCAGGGCTCGTTTACTGATTCCGGTCACTTTGAGGACCTAACTAATTCTTCTTTACATTCAAAAGAATCTGTGCACATGTTGACCCATGATGCTGCATGCATGACGGAGACAATTGACAGTGATGAAGAGCGCCGGAATTTAATAGCTTTCTATGAAAAGAAAGTAGAAGACATAATGAGAGCTCATGTTGGTGAGACACAAGAAATTAAAAAATCACACAATGACAAAGTAGAGGCGTTATTACAAAAGTTATCGGATGTCAACACCAGGTATTGTGAACTTCTACCAAACTATGAGCAGGCTAAAGAGAGGATACATATACTGGAAAAACAGTTGGAGAATGCCAGCAAACAGTTACAAGACGAAGAGAGCAGACACCGTACAGTATACTTACAAATGTACAACAAGGGAAAGGAAGCTGCAAAGTTTGAACTAGATAAAGAAATAGATCCCGAACCTAGTACAAGCCAGCTCAGTAGAGTCTCCGTCGAGGAACTGTTGGAACAATTGCAGATAACGCAGACCGAGCTCGAAAATGTCAGAGATTCGGCATTCACAGCGGACAGAACGGCAAAGTCACAAGTACTTCTTAGTGCAAAGGAGGCTGTTTCTTTATGGGTCCTAGGAGCTCGAAAGGCAATGTATCGACGTATTGTGGAGTCCCAGAAAGGAAACAAGACCATCATTGATCCGGAAGTGACCTTGCAGTTCCTGAAATCGGCGATCTACTACTTCCTGACGGATCCCGAAAACCATCAAGGTCACCTGAATGCCATCGAAAACATCCTAGGGTTCACCGAAGCTGAAAAGAAAAATATACGCAAAGCGAGAACGACGTAG

Protein sequence:

>DPOGS207371-PA
MNRNCGRKTEFQDEHYCAGNVTGNFRRACSLRLRGEKMVQRSPLTTRKYIPIITENTNQKQRNGSRLPEPSYRARSQSFNSTQKPKKSCLKFQETSIDRNLSMTDSAHTPPGSPEDLPDDESLHSYGSAATAASIDAGYAPFNGTTFSGRSMRYVLHCSSHAGLAGEDYLTPTQRAQKQIRRLKSLLAQAKKDLEEKDSEIFQLTKEVVELRLYKASIFSPDEKSNSSEIVTVRENNDEASIEEESIKCKSALRSFDITDSPLYKDQTPTRCRNEMQGSFTDSGHFEDLTNSSLHSKESVHMLTHDAACMTETIDSDEERRNLIAFYEKKVEDIMRAHVGETQEIKKSHNDKVEALLQKLSDVNTRYCELLPNYEQAKERIHILEKQLENASKQLQDEESRHRTVYLQMYNKGKEAAKFELDKEIDPEPSTSQLSRVSVEELLEQLQITQTELENVRDSAFTADRTAKSQVLLSAKEAVSLWVLGARKAMYRRIVESQKGNKTIIDPEVTLQFLKSAIYYFLTDPENHQGHLNAIENILGFTEAEKKNIRKARTT-