Monarch geneset OGS2.0

DPOGS211481
TranscriptDPOGS211481-TA1134 bp
ProteinDPOGS211481-PA377 aa
Genomic positionDPSCF300113 - 75977-87884
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0159112e-5571.20% 
BombyxBGIBMGA007992-TA2e-7459.93% 
DrosophilaCG32406-PA3e-4139.93% 
EBI UniRef50UniRef50_E2A6562e-5638.79%Tensin-1 n=7 Tax=Formicidae RepID=E2A656_CAMFO
NCBI RefSeqXP_002432244.12e-5640.86%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3071848018e-5638.79%Tensin-1 [Camponotus floridanus]
NCBI nr blastxgi|3071848013e-5940.32%Tensin-1 [Camponotus floridanus]
Group
Gene OntologyGO:00055151.2e-19protein binding
KEGG pathwaydpe:Dper_GL122921e-06 
 K00665 (FASN)maps-> Insulin signaling pathway
    Fatty acid biosynthesis
InterPro domain[256-362] IPR0009801.2e-19SH2 motif
Orthology groupMCL15877 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211481-TA
ATGCGGTACGAAGCAGGCTCCGCGGCTGCAATTTCGATGAACAAAGTGAAATACGAATCCATTAAGTCCGGCGACACACTGGAAGCGGCCGAGGAGCTGAACGCGTTAGTGGGAGATGCGTTTCGGATGGCTTTCGCGTCGCAACTGCAACCTTCCGCTCCACTCTGGAGCAAAGAGCTAAGTGGAGGCTCGTGCCCTCGTGCTCCCGACACTCTCCCGGGACTGACGCACCATTACCCTCACCGACTGGACCGGTCTCCATCACAATGCGATATAATGGGCGGGTCCAAGCTGGACGGCTGCGATGCGAGCGAACCCAGCGAGGCCAGCGAGGCCAGTCCCAGCTCCTCGGAGGATTGGAACTCACCCACAGAACTCAGCTGTTACAGGAGACTCGGAGAGAGACTCGAGAGAATGGAGCGACCTCCCCTCATGAAAAGACTCGCGCTAGGACTCAGCGGAGCCTTGCTGCAGCCCTCGGACGACGACGCGACGCCACTCGTGACAGACACGCCCACTACCCCCACCAACGTCCCGCTGTGTCTTAACGGAGGATACATCAACGAGGCGTCCGAAGCCGAGGCTCGTCGAGACGCTCGCCGGGACCAGAGGCCGGACCCAGACTTCAGAACCTGCAATAATATTACAAACGGCCGCCCCGGCCCCGGCAGCGGAAGCGGTGCTGGCTCCTCGTCCTCCGGGGAGTCTTCCTGTTCGCGAGCCTCCGGTAACAACAACAATTGTAACTTGCGACCTCTGGAGCCTGAACTGAGACACGCTCCATGGTTCCAGCAGGGCATCCCTAGAGAGATAGCCCTGGAGGTGCTCGGGGCGCAGACCCCGGGCTCGTTTCTTGTCCGCGCCTCCACCACTCAGGCGGGCTGCCTTGCGTTGTCTCTGCGGGTGCCCCGGGACTTCGCTCCTCACGGCATCGCCCATTACCTAATACTGAGAACTAATAAGGGATACAAAATCAAGGGCTTCACGAAGGAGTTCAGTTCCCTATCAGCCCTCGTGACTCACCACAGCGTGATGCCGGAGCTGTTGCCGGTCGCTCTCAGGCTGCCGAGACGAGCGCCGCGCTACAACGACGAGAGGGAAAACATCGACGAGCTCCGCACACACGACCTCTGA

Protein sequence:

>DPOGS211481-PA
MRYEAGSAAAISMNKVKYESIKSGDTLEAAEELNALVGDAFRMAFASQLQPSAPLWSKELSGGSCPRAPDTLPGLTHHYPHRLDRSPSQCDIMGGSKLDGCDASEPSEASEASPSSSEDWNSPTELSCYRRLGERLERMERPPLMKRLALGLSGALLQPSDDDATPLVTDTPTTPTNVPLCLNGGYINEASEAEARRDARRDQRPDPDFRTCNNITNGRPGPGSGSGAGSSSSGESSCSRASGNNNNCNLRPLEPELRHAPWFQQGIPREIALEVLGAQTPGSFLVRASTTQAGCLALSLRVPRDFAPHGIAHYLILRTNKGYKIKGFTKEFSSLSALVTHHSVMPELLPVALRLPRRAPRYNDERENIDELRTHDL-