Monarch geneset OGS2.0

DPOGS210793
TranscriptDPOGS210793-TA2031 bp
ProteinDPOGS210793-PA676 aa
Genomic positionDPSCF300027 - 1171100-1176600
RNAseq coverage572x (Rank: top 22%)
Annotation
HeliconiusHMEL0077390.091.95% 
BombyxBGIBMGA007108-TA0.092.24% 
DrosophilaCG12424-PC2e-8140.93% 
EBI UniRef50UniRef50_D6WDR07e-16255.54%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WDR0_TRICA
NCBI RefSeqXP_970959.13e-16055.41%PREDICTED: similar to GA11625-PA [Tribolium castaneum]
NCBI nr blastpgi|2700034873e-16155.54%hypothetical protein TcasGA2_TC002730 [Tribolium castaneum]
NCBI nr blastxgi|2700034870.054.97%hypothetical protein TcasGA2_TC002730 [Tribolium castaneum]
Group
Gene OntologyGO:00055159.3e-18protein binding
KEGG pathwaytgu:1002183462e-06 
 K05111 (EPHB2, ERK, DRT)maps-> Axon guidance
InterPro domain[253-320] IPR0137612.5e-19Sterile alpha motif-type
[246-323] IPR0109939.3e-18Sterile alpha motif homology
[259-317] IPR0211292e-15Sterile alpha motif, type 1
[252-319] IPR0016609.3e-15Sterile alpha motif domain
Orthology groupMCL15846 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210793-TA
ATGTCCCGCGGAGGTAGCGGCGCTGGTAAGACAGCGACTGCAAAGCGAGTGCCTCCGCCAGCGGTGCCAGGGGATGCATTCGGGACTCCGCAACACAGACACTCGGGCTCGAGCTTCGGCTCTCAGGGTTATGCCTCGTGCGAAGAGCAGGCGTACCCTCAGCCCCCTGACACGCCCTCACATACAAGAGATGATCACTCGGACTATGGCAGCACTGTGTCGGGTGTGTCCGGCGCTAGCGGGAGTCTCGGCAAGAGTCCTGCTGGGGGAGGAACCTTCACCTTCCCTCCACCCACCCAGCCCCTCACACACAAGGCAGCTGTATACTATCATCACCAACTAGCGCTGAGTGATGACCAGGGCATAGATATGACACAGAGTCCCGGGCGCGACAGTCCCGGTTCATCCTCGGGCTCCGCTGGTTCAGGGTCCCGGCATTCGTCCGCGTCTCTGGACAGCGGCCGAGCGTCCGGTCGGATGCCTCACCATCATCACGCCACCTGCCACTGTGGAGATACCGTCGATAGAGTGCGGGCCATGATCGGTCAGGGACTACCGGACCATGATATTATTCATGCATGGCTGGCGGATTTACAAATGGAAGAATACGCTAGACTGTTCGTCGAAGCGGGCTACGATCTGGCGACCGTTACTAGAATGACGCCCGAGGATCTCACAGCGGTCGGCATCAAGAAACCAAACCATCGCAAGAGGTTAAAGGCTGAACTCACCAACCTCAACGTGCCCGACAATCTGCCTGATTACATCCCGGGTTCTCTGGAGGAATGGCTTCGTCTGCTGCGGTTAGAGGAATATGGGCCGGCTTTAGTGGCGCAGGGATATCGCACGGTGCATGACGTCACACAACTCGCCTGGGAGGATCTAGAAGATATGGGCATCGTTCGTTTGGGACATCAGAAGAAAATTTTGCTAGCCATTAAAAGAGTGAAAGACATCAGGGCGGGGAAGCGAAGCATCAGCAGCCAAGGATCGCTCGATTTCACGAGACTATCTGGACAGGATTTATATACCCGAGAGCATTACCAGCCGTGGGAGAGGAGGAGTTACCATAAACCCCCCGATCTAAAGTTTGACGCTCTACCGACCCTAGCCACTGATTTGGTCCCCATCCAGATACGTCATCCGCGCGGGAAATCCCTCGAAAGTCTAGAAGATCCGACGGAAAGGACTTCTCATACAACGTTTTCACCTGAAGCAGGGTTCTATTATGGAGGGCAATGGAGACGTTCCTATGACGATGGGGATATAACACCCACAAACGACAGCTCGTACGAAGGCGGCGGAACCCTGCCGCGTCCGAGGGGCTTAGTAAGACCGCGGCCTGTCGCCAAAATACAAGCCACTCCAGCATACAGAGACAAATCACCAGATTACACCTACGACGAGATAGCGTACTCAGCTAGATTACAACGCGTCGCGTACGGCGCGAGTCCTCACGTGGCGAGGAAGCCCCCTCCAGAACCGCCCAAACGTCAAAGTTCCCAATACGCACCGTTCACGAGATTTGGTCAAACCACGGTCGAAATCCATCCAGAGAAGAGTCTTCCATTGAGTCTGCCCGCGTATCCAAGTTCAGATTCCTTGAGCGTCTCTTTGGACAGCACGGGTCTATTGCCGCCGCCGCCCGCACCTTCCTCTCCGCCCAGGAGATACGAAGACGATAAAATGAGAACCGGATCCGACGCTAGCTTTAAGTCAAGTTCCAGCACCGAATCAGATAGCATTCCATTCGCAAACGAGAACGCGGGGACGATAAAACAGAACAGAGGCCAAATAACTGGTAGACCGCACACCGTCGACTACGGACGAGCGGGCATCAACCTCACCGGCCTACCGCCGCGGAACCCCGACATGAAACCCAACCATCCCCTCCCGGAACATAAAGACGAGAACAAAAGTACGGAACCAGTGGACGTGCTCAATGACATAGGCAACATGCTGGCCAACCTAACAGACGAATTGGACGCCATGTTGGAAGAAGAAAAACGCCAGGGTTTGACTGATTCTTAA

Protein sequence:

>DPOGS210793-PA
MSRGGSGAGKTATAKRVPPPAVPGDAFGTPQHRHSGSSFGSQGYASCEEQAYPQPPDTPSHTRDDHSDYGSTVSGVSGASGSLGKSPAGGGTFTFPPPTQPLTHKAAVYYHHQLALSDDQGIDMTQSPGRDSPGSSSGSAGSGSRHSSASLDSGRASGRMPHHHHATCHCGDTVDRVRAMIGQGLPDHDIIHAWLADLQMEEYARLFVEAGYDLATVTRMTPEDLTAVGIKKPNHRKRLKAELTNLNVPDNLPDYIPGSLEEWLRLLRLEEYGPALVAQGYRTVHDVTQLAWEDLEDMGIVRLGHQKKILLAIKRVKDIRAGKRSISSQGSLDFTRLSGQDLYTREHYQPWERRSYHKPPDLKFDALPTLATDLVPIQIRHPRGKSLESLEDPTERTSHTTFSPEAGFYYGGQWRRSYDDGDITPTNDSSYEGGGTLPRPRGLVRPRPVAKIQATPAYRDKSPDYTYDEIAYSARLQRVAYGASPHVARKPPPEPPKRQSSQYAPFTRFGQTTVEIHPEKSLPLSLPAYPSSDSLSVSLDSTGLLPPPPAPSSPPRRYEDDKMRTGSDASFKSSSSTESDSIPFANENAGTIKQNRGQITGRPHTVDYGRAGINLTGLPPRNPDMKPNHPLPEHKDENKSTEPVDVLNDIGNMLANLTDELDAMLEEEKRQGLTDS-