Monarch geneset OGS2.0

DPOGS207434
TranscriptDPOGS207434-TA1824 bp
ProteinDPOGS207434-PA607 aa
Genomic positionDPSCF300087 + 547892-603526
RNAseq coverage256x (Rank: top 41%)
Annotation
HeliconiusHMEL0156218e-13180.00% 
BombyxBGIBMGA009333-TA0.082.63% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020611DD1e-14055.27%UPI00020611DD related cluster n=2 Tax=unknown RepID=UPI00020611DD
NCBI RefSeqXP_394328.35e-15950.00%PREDICTED: similar to mammalian FE65 Homolog family member (feh-1) [Apis mellifera]
NCBI nr blastpgi|3407105692e-15949.17%PREDICTED: amyloid beta A4 precursor protein-binding family B member 2-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3407105697e-15350.63%PREDICTED: amyloid beta A4 precursor protein-binding family B member 2-like isoform 1 [Bombus terrestris]
Group
Gene OntologyGO:00055151.8e-22protein binding
KEGG pathway 
InterPro domain[232-405] IPR0060201.8e-22Phosphotyrosine interaction domain
[424-552] IPR0119932.7e-22Pleckstrin homology-type
Orthology groupMCL15685 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207434-TA
ATGGGACTTCACAAGAGAACAAAGAATGGTCGGCTCTCCTACGAGAATCCAAACTACCATTTGGACCCGGCGCGACTCGAGTCTGCCATTTACAGTGAACTTTACGACATGCATCAGGACAAGAGCATGCCTCTAGAGTACGACAACTACCTCGACAACAGAAGAGTTGAAGACGAAGCGTCTTCGCCAGTTAATAAAGTAAACGATAAAAGCGGCTTCATCACGCCGCCCCAACAGAATGGCGGCGGGGAGTCGCCCCCTCCCGAGAAAGAGAGGGCTGACCAAGAGGACACGGAAAAGAGACACTCTGGACCCGTGCCTCACTGTGCCGAGGGACCCAACGATGATCTTTACGCCATCCCTGTTAAACTGAGACCCAAGAAAGAAACTTTGCCTCCCGGATGGGAAAAACATGAAGATAACGACCCGGGTATGATGACCGTGTATTTGAGCGGCACAATTCAGAGGGAAATCCCGAAGATGCCGCCGATAGAGGCTAGAGAGTCTCGCATCTCCATGGTGAGGGACTGCTCCAACTTGTCGGATGTCAAATACGAGGGGGCAATGACCTCGTCCGTCACCAGGAGTACCACCAGCGGAGCTTTGGATCACGTGGACCAAGACGCCGAGAGAAAACGGAGAGAGGAAGTCGCTTACAAACGTCGCAGCTACCCCGCTCGCCCTGACTCCGAGGGTAGAGCGGTTCGCTTCTTCGTGCGTTCTCTGGGCTGGGTGGAGATCTCTGAGGCTGATCTGACCCCTGAGAGGTCAAGCCGCGCTGTCAATAAGTGCATCGTCGACCTCAGTCTTGGGAGAAATGACCTCCTCGATCAGGTCGGCCGCTGGGGCGATGGTAAAGATCTGTTCATGGACCTGGACGACGGCGCCCTGAAGCTGATAGATCCAGAGAGCTTGAACACTTTGCACACGCAACCAATACACACTATCCGTGTTTGGGGAGTGGGCAGAGACAACGGCCGACCAGTGGCACATGCACCAGTTAATGAAACAGAGTTTAGTATACCGTACGAGATGACAAGTGACTGGGATAAGGGACCCAGGGGACAGGACTTCGCGTATGTCGCCCGGGATCGCAGCACGCGCCAGCACATGTGTCACGTGTTCCGCTGCGAGGCCCCAGCCCGGTCGATCGCCAACGCGCTGAGAGATATCTGCAAGCGTATCATGATAGAGAGGTCGCTGCAGCCGCCGCCGAGACCCACAGACCTGCCGGCCGCCAGGCGACCGCGTCCGCTGTCAGGCGCCTCGTTCCCGACCCCAATGGAGGAGCCGCGCAAAACGGTTCGGGCGCGTTACTTGGGCAGCGCGGAGGTGCCCCGCGCCACCGGCATGTCGGTGCTCAACGACGCCATCGACCGCCTGGCCGCTGCCAGGGCCCCGTCCGCCTGGAGACCCGTCGCCGTCGCTGTGGCGCCCTCTATGATCACCATCACTGAGGAAGGAGAGGTGTCTCCGCTGGCGGAGTGTCGCGTGCGCTACTTGTCGTTCCTGGGCATCGGGCGGGACGTGCGTCGCTGTGCGTTCATCGTCCACTCGCCGCGGGACGTGTTCGTGGCGCACGCCTTCCACGCCGAGCCCTCGGCCGGCGCGCTCTGTAAGACTATAGAGGCAGCTTGCAAGCTCCGCTACCAGAAGTGTTTAGATGCGCACGGCGGCGCCCTGGGGCTCCCGGGGCTGGGCGGCGGTGCTATCCAGGCCCCAGTAATGTCTAGTTACTCAGCACTCTCCCTGAACATCCCGGGAAACCCACATTGTGAAGCTCTGAAGGTGTTTCGTACAAAATTTATGACGTCACGTTCCTAG

Protein sequence:

>DPOGS207434-PA
MGLHKRTKNGRLSYENPNYHLDPARLESAIYSELYDMHQDKSMPLEYDNYLDNRRVEDEASSPVNKVNDKSGFITPPQQNGGGESPPPEKERADQEDTEKRHSGPVPHCAEGPNDDLYAIPVKLRPKKETLPPGWEKHEDNDPGMMTVYLSGTIQREIPKMPPIEARESRISMVRDCSNLSDVKYEGAMTSSVTRSTTSGALDHVDQDAERKRREEVAYKRRSYPARPDSEGRAVRFFVRSLGWVEISEADLTPERSSRAVNKCIVDLSLGRNDLLDQVGRWGDGKDLFMDLDDGALKLIDPESLNTLHTQPIHTIRVWGVGRDNGRPVAHAPVNETEFSIPYEMTSDWDKGPRGQDFAYVARDRSTRQHMCHVFRCEAPARSIANALRDICKRIMIERSLQPPPRPTDLPAARRPRPLSGASFPTPMEEPRKTVRARYLGSAEVPRATGMSVLNDAIDRLAAARAPSAWRPVAVAVAPSMITITEEGEVSPLAECRVRYLSFLGIGRDVRRCAFIVHSPRDVFVAHAFHAEPSAGALCKTIEAACKLRYQKCLDAHGGALGLPGLGGGAIQAPVMSSYSALSLNIPGNPHCEALKVFRTKFMTSRS-