Monarch geneset OGS2.0

DPOGS207091
TranscriptDPOGS207091-TA2034 bp
ProteinDPOGS207091-PA677 aa
Genomic positionDPSCF300001 + 2839267-2841300
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0061442e-16471.96% 
BombyxBGIBMGA013055-TA0.059.10% 
Drosophila% 
EBI UniRef50UniRef50_E2AAZ23e-1842.66%Ataxin-7-like protein 1 n=1 Tax=Camponotus floridanus RepID=E2AAZ2_CAMFO
NCBI RefSeqXP_002405376.11e-1764.52%hypothetical protein IscW_ISCW005075 [Ixodes scapularis]
NCBI nr blastpgi|3407107393e-1860.49%PREDICTED: hypothetical protein LOC100649615 [Bombus terrestris]
NCBI nr blastxgi|3544988762e-2226.30%PREDICTED: ataxin-7-like protein 1-like [Cricetulus griseus]
Group
KEGG pathway 
InterPro domain[339-406] IPR0132432.3e-23SCA7 domain
Orthology groupMCL22317 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207091-TA
ATGTCCGCAGAATTCCCTTTCGCGACGAAGATAGTAAGTCCCTTAAAATGCATTGAAAAGCCATGGGATTTGTGGGTTAGCGAAATTGGACACCTGTCTCCTCGCCGCGATGACGACGCCATGGGCTTCCCGTCATCGGCGACGGATAGTCGGGGCCATGCCGTGGTCAGGCGCCCACACGAAGTTCATCCATACAAATCCACGGAGAGGATTCAGAAAGTTTATCGACGAGCCGGAGTCAATCGTCTTCAATACGAGAGCATGAGTCTCCATGGTCTTTTCCCACAAATGGATAATCTTAACGCGGCTGTGTGTCATATGTGTGGAATCACAGTGAAGTGTAGTGCAGCATACAGACATTTATTAGAGTCCCATACCGGTGCAGAACCAATACTCCCACCTCCTCCTCCCACCGTAAAGCTAAAATCCAATTTAAGTAGAAGCCGTCTTAAAAAAGATCCTTTGCCTCCTCTTCCTGTAGACATTCATCCAATAAAGTTGGAAACTTCTCCTGTTCATACAGCTGCAGCTCCTATCTCAAGGATAGTTTTGCCACAACCTGATTTACAATACTTGGAGGAAGCAAGCACTTCCTCAGCAATAACAGGGGAAGTACAAGTAAGCATGGAGAGTGGCGAGTTACCAGTTGTGAGTATACAGGACACTGAAGAATTACCTCTTGGTGAAAATATCACAGATGACATTTTTGCAATAATGAACTCTGAAGGAATTCAGAGTGCAGATGACATCACAAATGCCGCTGATTGGAAAAATATAATCAGGGATATAGGAAATATGCAAGAAATAAATTTTCCATCTACTGCAGATTCTATACAATCACAAGACTTATATTCAACAGTTGATACCTCATTCTCCAATTACACTTTAGCTGACTCAGATTTATCTAATTTGCAATTCCCTCCCAATGCATCACCCTTGCTGTCAACTGTGATAGACAATAATGTACTCAATGCTCAAGCTTTAAAACAGACGCCTACCACGCCGTCCACGAAAACATCACGAAAGTCAAGTAAAACGAAAGCAAACAATCAGAGAGAATACGACCCGAACAAGCATTGTGGGGTCGTGACAGCGGAGAACCCAAAGCCTTGCACGAGGTCACTGACGTGTAAGGCGCACGCGTTGTCCTTGCGACGCACAGTCGATGGCCGATCTAAGCCTTTTGACACACTGCTAGCCGAGCACCGAGCATCGCGAGAAGCTGCCGCCGTTGGAGCACCACTCGCCTCTCCCGCCTCGCTCCCACCACTCCTAGTCAATAGTTCGCTGGACCTCGGCAGTTTTAACGGACTGACCGCCGAGCAACAAGTCAACGACATATACGCGAGCTTGCTGAGCGTAGAGGATCCCATGCTGCCCGACACCTCCGGTATCACGTCTCTCTTGAGTCAGCCACTGTCGGACCCCTTCCTATTGCCGGACGACGCGGAGCCAGCGCCGGACGCAACTCCTCTCGTACCGCACCGCACGCGCGAGATCCCTCCCGTCCCAGCTACGTCCACCGCCTCCAGCTCCGCTCCATCGCTTGTTCCGGGCGACGTTTGTTGGTACGCCACCAGCCCTCGGCCACTCGCATTGTGTACATTCAACACATCGCACGCGGGCGGTGTCATTACACTGGGAAAGAAATTCGCAACCGTTCGGAATAACATAAAAACGTCGCTCTCGCGGTCAAGCAAGGCCACATGTGCCTCTAATAACTATTACGCACAGGGTATGTCACTTTCGAAGACCCTGCATATGAACAACGCCAACAAGACGAACAAACCCGAGGTTCGAAAGTTGATAGTGACGTGTAGTGCGCCCGGTGTTCAGAGGGAAATGCAACAGACTTTGAGTGATCTCTTTGGACCAGATGTGAGACATACACTGAACGGTCACATGGGCCACCCTAGCCACATCGGATTAGGTAGGAGCACGCGAACAACGCTAAAGAGCGCCAAGCGAGCCTCAGTTGCCGCGCTCGATCTTGGCTTCCCTCTTGACCCATTGCTAGCCGACGAGAAGTGTTGA

Protein sequence:

>DPOGS207091-PA
MSAEFPFATKIVSPLKCIEKPWDLWVSEIGHLSPRRDDDAMGFPSSATDSRGHAVVRRPHEVHPYKSTERIQKVYRRAGVNRLQYESMSLHGLFPQMDNLNAAVCHMCGITVKCSAAYRHLLESHTGAEPILPPPPPTVKLKSNLSRSRLKKDPLPPLPVDIHPIKLETSPVHTAAAPISRIVLPQPDLQYLEEASTSSAITGEVQVSMESGELPVVSIQDTEELPLGENITDDIFAIMNSEGIQSADDITNAADWKNIIRDIGNMQEINFPSTADSIQSQDLYSTVDTSFSNYTLADSDLSNLQFPPNASPLLSTVIDNNVLNAQALKQTPTTPSTKTSRKSSKTKANNQREYDPNKHCGVVTAENPKPCTRSLTCKAHALSLRRTVDGRSKPFDTLLAEHRASREAAAVGAPLASPASLPPLLVNSSLDLGSFNGLTAEQQVNDIYASLLSVEDPMLPDTSGITSLLSQPLSDPFLLPDDAEPAPDATPLVPHRTREIPPVPATSTASSSAPSLVPGDVCWYATSPRPLALCTFNTSHAGGVITLGKKFATVRNNIKTSLSRSSKATCASNNYYAQGMSLSKTLHMNNANKTNKPEVRKLIVTCSAPGVQREMQQTLSDLFGPDVRHTLNGHMGHPSHIGLGRSTRTTLKSAKRASVAALDLGFPLDPLLADEKC-