Monarch geneset OGS2.0

DPOGS204384
TranscriptDPOGS204384-TA1821 bp
ProteinDPOGS204384-PA606 aa
Genomic positionDPSCF300002 - 1683987-1694639
RNAseq coverage800x (Rank: top 16%)
Annotation
HeliconiusHMEL0130790.084.03% 
Bombyx% 
DrosophilaCip4-PB1e-15248.70% 
EBI UniRef50UniRef50_E2A0X36e-16451.73%Formin-binding protein 1-like n=9 Tax=Endopterygota RepID=E2A0X3_CAMFO
NCBI RefSeqXP_397251.31e-16351.13%PREDICTED: similar to Cip4 CG15015-PA [Apis mellifera]
NCBI nr blastpgi|3071881942e-16351.73%Formin-binding protein 1-like [Camponotus floridanus]
NCBI nr blastxgi|3071881943e-16452.72%Formin-binding protein 1-like [Camponotus floridanus]
Group
Gene OntologyGO:00055151.8e-09protein binding
KEGG pathwaymdo:1000262341e-89 
 K07196 (TRIP10, CIP4)maps-> Insulin signaling pathway
InterPro domain[17-103] IPR0010606.8e-18Fps/Fes/Fer/CIP4 homology
[521-598] IPR0014521.8e-09Src homology-3 domain
Orthology groupMCL12010 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204384-TA
ATGTTCGTCAGTCTTAATGAGAAGCGACAGTACGCTATAGATGCTGGCTGGACCTCTGATCAATATGATAACTTAGCCACCCATACTCATAAAGGTATCGAATTCTTAGATAAATATGGGAATTTTGTGAAGGAACGGTGTGCTATAGAGTTAGAATATGCTGGCAAATTAAGGAGGCTTGTGAAGAGCTATCAACCAAAAAGAAAGGAAGAAGACGAATACGTATATACATCATGTAAAGCATTCAGACAGCTCCTTCAGGAGTTGGGTGATTTTGCTGGTCAGAGGGAGTTGGTGGCTGAGAATTTACAGTCGAACGTTGTACGAGAGTTACATTTACTAGCTAAGGAGCTGAGAGAGGCTAGGAAGGGACACTTAAACGACGGTTCAAAGCAAATGGCCGTCCTTAGTACGGCCGTGGGAGCGTTGGAGCGTGCGCGTCGTTCATACGAGAGGGCGGCGCGGGAGTCGGAGCGTGCACTGGAAGCCTTCCAGCGAGCGGACGCAGACCTGCACCTCAGTCGGGCCGAGGTCGAGAAACAGAAGATGAACATGAAATTAAGAAGTCAAGCGTGCGAGGACGCGAAACAGGAATACATGGACCAACTGAGGAAGACCAATGAGGCACAGAGGCAGCACTACGAACAACAGCTGCCGCACGTTTTCAAACAGTTACAGGATTTGGACGAGAAGAGAATAAAACATATAAAGAATTTCATGATCAGCTCAGTGGATGTCGAGAGGAAGGTGTTCCCGATTATAATGCAATGTCTCGACGGTATGGAACAGGCCGCCAAGAATATAAACGAGAAAGAGGACACTCAGTTAGTAATAGAGAGGTACAAATCTGGTTTCGTCCCGCCTGAAGACTTCAGATTTGAACCCGCGACTGGCGCTGACGCCACGGACTCTGTACCAGCCCCTACCCACAACCATATCACAGTTAGAGGCACGGTGTCCGGTAACCGGATCAAGAAGAGAGGTGGCCTGCTCTCTATATTCAGCTCAAACAAGAATAACTTGTCGGTCGATGGAAAGGAGGATTATTCAGATTTACCGCCCAACCAGAAGAAAAAGAAACTGCTAGCTAAAGTACACGAATTGACCAAGCAGGTTGGCCAAGAACAAGCTGCTATGGAGGGTCTTATGAAAATGAAAGGGGTCTACGAAACAAATCCCACGCTGGGCGACCCTATGACTGTGGAAGGTCAGCTTAACGAATGTTGTGATAAACTAAAGAAGCTCCGTCTCCAACTACGCAAATACGAAGAGCTACTGGCGGAAGCGAACAACCAGGTGTGCGCCCAGCCCATACACTCCATCAACAAGACTAACGGCGCCCCCACACAGGCCACGAGCATCGGTTCGAACAGCGGTTCCCTATCCCGCTCAGCGTCCGAATCCTCCGTGAGTACGGGTACCGGCACTAACACTGGCAACACCGTCATGGCGGTGTCGTCCCGGGCCGCGGGGGGTTCCCCGGAGTCGGGTCTCGGCGGCGAGCTGGCCGCGGGTCACGCGGAACACGCCAACGGTCATGACCACGATGACCACGACCACGACCACGACGACCACGAGTCCGACTTCGATTATTATTACGAGCCGGACTTACAGCCACTAGGTTACTGCAAGGCGCTTTATGCTTTCGAAGCGAACGGCACCGGCTCAACGATGCGTATGGAGTGCGGCGAGAAGCTGCTGGTGTTGGAGACGGACGCTGGTGACGGCTGGACGAGGGTGAGGCGGTCGCTCACCAGGGAGGAGGGCTTCGTACCCACCACGTACATCGCCACCACGCTGTACGCCGACGTGCATCACTAG

Protein sequence:

>DPOGS204384-PA
MFVSLNEKRQYAIDAGWTSDQYDNLATHTHKGIEFLDKYGNFVKERCAIELEYAGKLRRLVKSYQPKRKEEDEYVYTSCKAFRQLLQELGDFAGQRELVAENLQSNVVRELHLLAKELREARKGHLNDGSKQMAVLSTAVGALERARRSYERAARESERALEAFQRADADLHLSRAEVEKQKMNMKLRSQACEDAKQEYMDQLRKTNEAQRQHYEQQLPHVFKQLQDLDEKRIKHIKNFMISSVDVERKVFPIIMQCLDGMEQAAKNINEKEDTQLVIERYKSGFVPPEDFRFEPATGADATDSVPAPTHNHITVRGTVSGNRIKKRGGLLSIFSSNKNNLSVDGKEDYSDLPPNQKKKKLLAKVHELTKQVGQEQAAMEGLMKMKGVYETNPTLGDPMTVEGQLNECCDKLKKLRLQLRKYEELLAEANNQVCAQPIHSINKTNGAPTQATSIGSNSGSLSRSASESSVSTGTGTNTGNTVMAVSSRAAGGSPESGLGGELAAGHAEHANGHDHDDHDHDHDDHESDFDYYYEPDLQPLGYCKALYAFEANGTGSTMRMECGEKLLVLETDAGDGWTRVRRSLTREEGFVPTTYIATTLYADVHH-