Monarch geneset OGS2.0

DPOGS207276
TranscriptDPOGS207276-TA1599 bp
ProteinDPOGS207276-PA532 aa
Genomic positionDPSCF300008 - 11563-16731
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0024062e-12667.40% 
BombyxBGIBMGA011749-TA1e-13889.38% 
DrosophilaKap-alpha1-PA9e-1021.89% 
EBI UniRef50UniRef50_E2A6U79e-15051.37%Sperm-associated antigen 6 n=7 Tax=Neoptera RepID=E2A6U7_CAMFO
NCBI RefSeqXP_002164433.17e-15050.88%PREDICTED: similar to predicted protein [Hydra magnipapillata]
NCBI nr blastpgi|3504207726e-15151.04%PREDICTED: sperm-associated antigen 6-like [Bombus impatiens]
NCBI nr blastxgi|3504207723e-14551.04%PREDICTED: sperm-associated antigen 6-like [Bombus impatiens]
Group
Gene OntologyGO:00054885.6e-66binding
GO:00055151.1e-07protein binding
KEGG pathwayafm:AFUA_5G135403e-08 
 K08332 (VAC8)maps-> Regulation of autophagy
InterPro domain[44-485] IPR0119895.6e-66Armadillo-like helical
[8-453] IPR0160243.8e-51Armadillo-type fold
[120-157] IPR0002251.1e-07Armadillo
Orthology groupMCL16260 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207276-TA
ATGAGTCATTCCATGAGCAGCCCGCGGAATATTCAACTCGTTTTTGAAACTTATCAAAAAGCCAGATTGGTTTTCGTTCAGACAATGGCTGAACTGGCTACAAGAGCAACTAATGTTAAATGTCTGGAGAGTGCTGGCGTTTTGGAATTGCTTCGGCCACTGTTGTATGATGCGTGTAGCACGGTTCGACAATGTGCGGTAGTGGCTGCAGCACGGCTTGCAGAACATGACGAAAATGTTGCTAGACAAATTCTCAACGGTGGCATGCTTACCATAACGCTGGAAAACCTCAACAAATATAATGTGTATTATAAACGGTCGGCGCTGTATTTAATGAGGGCGGTGGCAAAACATAACGAAGAATTAGCTTCGGCTATTGTAAGACTCGGTGCCTTAGAACATATTGTTTCTTGCTTAGAAGATTTCGACACGCAGGTCAAGGAAAATGCGGCTTGGGCATTAGGTTACATTGGAAAACATAGCGAACATTTATCAGGATTAGTCGTGGATGCTGGAACACTCCCGCTATTGGTTTTGGCCTTTCAAGAACCAGAAATGAGTTTGAAACAGATAGCTGCTGGTGCTTTGGTAGATTTAGCACAACACAAGCCTGAAGCCGTGGTAGACGCGGGAGCTATTTGTCATTTAGTGCGCGGCTTGGAAAACCAAGACCCTAAGCTAAAGCGTAGCACACTTTGCGCCCTGAGTGCCGTGGCGGGCACACGCGCGGCGCTGGCTGAGGCGGTAGTTGCTGGTGGAGCCCTACCGCCCGCCTTGCTTCACGCCGGCCACGATACTGCACCCGTTCGCCGCGCCGCCGCCTGTCTATTACGTGATATAGTCAAACATTCCGTAGACTTAGCCCAATTGGTGGTGAATACGGGTGGGTGTGGGCCCCTAGTGGAATTGTTGGTTGAGAACACCGGTGGGACGAGAGTTCCTGCTTGTATGGCACTTGGTTACATCGCGGGCCAGTCGGACCAACTGGCGATGGCTGTTATTGAATCTAAGGCTGTGGTGGTGCTGGTAGAAATCTTACAGAACAGTGAGGATGATGCAGAACTTTGTGCTGCAGCTTGGACACTGGGTCACATAGCAAAACATTCACCACAACATAGCCTTGCTGTAGCTGTTGCAAACGCATTACCTAGACTTTTACAATTATATACTAATCCAAAGTCATCCAGTGAGGTCAGGGCAAGAGCCGCCTGTGCTTTAAAACAATCATTGCAATGTTGTCTTCATCGACCAGCCTTAGAACCCTTACTGCATGCTGCTCCGGCTTGCATTCTTAAATATGTTTTAGCACAATATGCAAAGATATTACCCAATGATGCAAGAGCAAGAAGATTATTTGTTACGACAGGGGCTTTAAAGAAGATACAAGAAATTGACACCGTACCAGGAACTTCATTAAAGGAGTACATAAATATAATTAACAGCTGTTTCCCTGAAGAAATCGTAAGGTATTACACGCCAGGTTTCTCAGATTCATTGCTTGACAGAGTTGAAGCCTATACACCTCAGATCCCAGAACTATTTACAGATAGAGTTCCTAGTGATTGTCAAAGCGAAGTGACTATAGAAAACGGAAATTAA

Protein sequence:

>DPOGS207276-PA
MSHSMSSPRNIQLVFETYQKARLVFVQTMAELATRATNVKCLESAGVLELLRPLLYDACSTVRQCAVVAAARLAEHDENVARQILNGGMLTITLENLNKYNVYYKRSALYLMRAVAKHNEELASAIVRLGALEHIVSCLEDFDTQVKENAAWALGYIGKHSEHLSGLVVDAGTLPLLVLAFQEPEMSLKQIAAGALVDLAQHKPEAVVDAGAICHLVRGLENQDPKLKRSTLCALSAVAGTRAALAEAVVAGGALPPALLHAGHDTAPVRRAAACLLRDIVKHSVDLAQLVVNTGGCGPLVELLVENTGGTRVPACMALGYIAGQSDQLAMAVIESKAVVVLVEILQNSEDDAELCAAAWTLGHIAKHSPQHSLAVAVANALPRLLQLYTNPKSSSEVRARAACALKQSLQCCLHRPALEPLLHAAPACILKYVLAQYAKILPNDARARRLFVTTGALKKIQEIDTVPGTSLKEYINIINSCFPEEIVRYYTPGFSDSLLDRVEAYTPQIPELFTDRVPSDCQSEVTIENGN-