Monarch geneset OGS2.0

DPOGS201238
TranscriptDPOGS201238-TA1089 bp
ProteinDPOGS201238-PA362 aa
Genomic positionDPSCF300037 - 139225-143082
RNAseq coverage537x (Rank: top 23%)
Annotation
HeliconiusHMEL0027713e-12780.07% 
BombyxBGIBMGA012500-TA2e-16775.41% 
DrosophilaAbi-PD2e-6663.68% 
EBI UniRef50UniRef50_B0XH695e-10952.60%Abl interactor 2 n=5 Tax=Arthropoda RepID=B0XH69_CULQU
NCBI RefSeqXP_973140.23e-12055.96%PREDICTED: similar to abl interactor 2 [Tribolium castaneum]
NCBI nr blastpgi|3227881612e-12355.53%hypothetical protein SINV_04899 [Solenopsis invicta]
NCBI nr blastxgi|2700015654e-13558.33%hypothetical protein TcasGA2_TC000412 [Tribolium castaneum]
Group
Gene OntologyGO:00057371.2e-29cytoplasm
GO:00055156.4e-23protein binding
KEGG pathwaygga:4241082e-55 
 K05751 (ABI2)maps-> Regulation of actin cytoskeleton
InterPro domain[93-159] IPR0128491.2e-29Abl-interactor, homeo-domain homologous domain
[293-361] IPR0014526.4e-23Src homology-3 domain
Orthology groupMCL14124 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201238-TA
ATGGCGGAGTTAGCAGCCTTACTTCAAACAGACATACCTGAGGGGCGCAACCATCTCACAGACAGTCATACAAACCTCGAGAGAGTTGCAGAATACTGTGAAGCTAACTACTTTCAGTCCGAAAACAAGAGATTAGCTTTAGACAGTACCAAGAACTACACAACACAGTCGCTAGCAAGTGTTGCATATCAAATAAATACTCTCGCATATAATTTCTTGCAACTATTAGAGCTACAAACTATGCAGTTAGCTGAAATGGAAGGTCAGATGAACCACATTGCACAAACCGTAGCTATCCATAAAGAAAAAGTTGCAAGACGGGAGATAGGTGTTCTGACTGCTAATAAAGTCACCAATAGACAGTATAAAATTATTGCACCAGCCAATCCAGAGAAACCTATAAAGTATGTACGGAAGAGTATTGATTATACAGCCTTGGATGATATCGGTCATGGTGCTCGTTGGAGTGGCTCGGGTGGGTCTGGTACACCTCGCGGTCGTCGCTCCGGGTCCGCTCCCAACCCTCTGCCTGCGCCAACAACCAAACCACCCACACCACCAGCTGTACGATCAACCAACGCAGCAAACACTGGAACACTAGGAAGGGGAACTCTGGGCAAGTCTTCCCGCGAGTACCGCACGCCTCCAGCCGTCGCTCCCCCTCAGGTAGTGAATTGTCACCAGGTGGGTATGGTGCACCCCAACCAGCATCAATCTCCATCCAACTATTCACAACAGGACCTACACTCCAGTATGCCACCTCCACCGAGCCCTTTGATTGGTTCAGACGGTGAAAACAGCCAACATTCCATGGTGCTGCCGGGACATGCTAAAATGGGTGGCCAGTACTCGAGTGCTGTGCCGACCATAGTACCAGACGAGGAAGACCTCCCGGGGTGGGTCCCCAAGAATTATATTGAGAAAGTGGTGGCGATATACGACTACTACGCGGACAAAGACGACGAGCTGTCGTTCCAGGAGAGTGCCGTCATCTACGTCCTGAAGAAGAACGACGACGGCTGGTGGGAGGGAGTCATGGACGGAGTCACCGGCCTCTTCCCCGGGAACTACGTCGAGCCCTGCGTCTAG

Protein sequence:

>DPOGS201238-PA
MAELAALLQTDIPEGRNHLTDSHTNLERVAEYCEANYFQSENKRLALDSTKNYTTQSLASVAYQINTLAYNFLQLLELQTMQLAEMEGQMNHIAQTVAIHKEKVARREIGVLTANKVTNRQYKIIAPANPEKPIKYVRKSIDYTALDDIGHGARWSGSGGSGTPRGRRSGSAPNPLPAPTTKPPTPPAVRSTNAANTGTLGRGTLGKSSREYRTPPAVAPPQVVNCHQVGMVHPNQHQSPSNYSQQDLHSSMPPPPSPLIGSDGENSQHSMVLPGHAKMGGQYSSAVPTIVPDEEDLPGWVPKNYIEKVVAIYDYYADKDDELSFQESAVIYVLKKNDDGWWEGVMDGVTGLFPGNYVEPCV-