Monarch geneset OGS2.0

DPOGS213797
TranscriptDPOGS213797-TA1902 bp
ProteinDPOGS213797-PA633 aa
Genomic positionDPSCF300106 - 171533-175031
RNAseq coverage355x (Rank: top 33%)
Annotation
HeliconiusHMEL0161580.085.41% 
BombyxBGIBMGA006782-TA0.081.31% 
DrosophilaCG11178-PA1e-13142.15% 
EBI UniRef50UniRef50_E2BA716e-14945.83%Uncharacterized protein KIAA0241 n=7 Tax=Formicidae RepID=E2BA71_HARSA
NCBI RefSeqXP_624093.18e-15348.20%PREDICTED: similar to CG11178-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|1892419221e-15048.38%PREDICTED: similar to rCG52383 [Tribolium castaneum]
NCBI nr blastxgi|1892419226e-15246.66%PREDICTED: similar to rCG52383 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[8-462] IPR0183079.8e-112Late secretory pathway protein AVL9
Orthology groupMCL13671 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213797-TA
ATGTCGTATATCAATGAACCAGTCTTAAACATTATAGTGGTTGGTTTCCATCATAAAAAAGGTTGCCAGGTCGAGCATTGTTACCCAGAGTTAATTCCTGGTCATCCATCTGAATTACCAGCAGCTTGGCGATATCTTCCAGCTTTAGCTCTGCCAGATGGCTCGCATAACTACTTATCAGACACAATATTCTTCAGCTTACCCGGATTGACTGAACCTGCTCACACTGTTTATGGTATATCATGTTTTCGACAAATACCCATAGAGCAAGTTGCTCAGAAAACGGAAGATATGACCAGAAGTTCTGTCCAAAAGAGCGTATGTGTTATTTGCCGAGCTCCATTGTTTGGTCGTCTGTCTGTAAAGGTGGAACTTGTGGTGAGGGCTTGGTTCCTCCAGGGTGACTTTTCCCAGACCAAGCTCTTAGAAGATGCTTTTAAACATCTCAATAGTTGTCCTGTTCAAATTGATCAGACCCTTGAAGGTTTATCAGTGCAGAAGTTAGTAGAAAACTGGAGACATAAAGCCTTATTGTTGTTTAAACTGTTACTGTTACGTCAGAAGGTTCTTATTTATGGATCGCCGGCCGGTTCACTGTCAACTGCATTGTTGACTCTTGTCTCTCTTTTACCGCAATGTCTCGAGTATGGATTGACTAAATCTGCTAATGTTGTTTTGTCTAGACCACTTTCACCAATACCTACTGTGATACCAGAAGAAAAAACTACAGAAGACAGCATTGATGATGCTTTAGATGTTGTAGAGGACGCATTAAATGGGACAAATCAGCCAGATGATGAATTGTCTCCAAAAGAAACTGGAAATTTGTTTGAGGAAAAGGACAGAGTCAGCCGTCAAAGTTTTGATGAATCAATCCTATCTGATGTTGTCGACAGACAAGAACTGTCTGCTGGAAGGGAAAAATGTCACAGTATAGGAGAAAAATATAAGGTACAAAAACCACTGTCCGAAGCACAGCAAAGCCCTACAATGGCTAGAGACATGAGTGTCGATGGTCTTTACAATCTGACGGGACAATTAGATCAAACTGAATGTGGTTTTCCCTTGAGCTTGTTTGAGGATGGCTACATATGTTTACCGTACTTGTCACTGCAATATTTAGATCTGCTGTCAGACACATCTGTGAAGGGTTTTGTTGTTGGAGCTTCGAATGTTCTTTTTAAACAGAAGAGGCAGCTATTTGACGTCTTGGTCGAGCTGAACGAAATGAGGATAGAAACAGCTGACATGAGTTTAAGACGTCAGCTGGTTCTCAATACTGAAGATCTTAGGTTTGCGGATCACGTGGTCCGCCACGCCTCCTCCCAGGGCGACACGTGGATACGGGATCAATTTGCTAGTTATTTAATATACTTATTACGAACTTCTTTATTACCAGAAGGGAGTCGGGAGATGGAATCTTATAATGCTCAGTTCATGACAGCCTTCAAGGCGACACCAGCCTACGACAAGTGGTTGAAAGCCACAAACAATGCCAACATAGAACCCTTCATGAACCTGACTCCTATGCATCCGTTTTCTGGACAACTATCTGTCGCTGACATGAAGTTAAAATTGGCACATACTATGTCGACTACGGAGGGCGGTCGTAAGGTGACGGCGGCCGTGGCGAGCACGGGGCGGGCTGTGGCGAGCACGTCGCGGGCCGTGGGCGGAGCGCTGTCACATGCTCGCGGCGCGCTATCGGGCTGGTGGAGCGCGCTCACAGCGCCCACGACCGCGCCCACCGACCCGGACGTGGTCGACTTGAGCGACGTGAGCGATGCTGTAGAAGCAGATGACGCGACGGGTCAACTGGAAGACGACAACCGCCGCCTCCCGGATCCTCCCGACAAGCGACTCGACCCCGCCATCGACGTCACCAACATAAGAGTCATTTAA

Protein sequence:

>DPOGS213797-PA
MSYINEPVLNIIVVGFHHKKGCQVEHCYPELIPGHPSELPAAWRYLPALALPDGSHNYLSDTIFFSLPGLTEPAHTVYGISCFRQIPIEQVAQKTEDMTRSSVQKSVCVICRAPLFGRLSVKVELVVRAWFLQGDFSQTKLLEDAFKHLNSCPVQIDQTLEGLSVQKLVENWRHKALLLFKLLLLRQKVLIYGSPAGSLSTALLTLVSLLPQCLEYGLTKSANVVLSRPLSPIPTVIPEEKTTEDSIDDALDVVEDALNGTNQPDDELSPKETGNLFEEKDRVSRQSFDESILSDVVDRQELSAGREKCHSIGEKYKVQKPLSEAQQSPTMARDMSVDGLYNLTGQLDQTECGFPLSLFEDGYICLPYLSLQYLDLLSDTSVKGFVVGASNVLFKQKRQLFDVLVELNEMRIETADMSLRRQLVLNTEDLRFADHVVRHASSQGDTWIRDQFASYLIYLLRTSLLPEGSREMESYNAQFMTAFKATPAYDKWLKATNNANIEPFMNLTPMHPFSGQLSVADMKLKLAHTMSTTEGGRKVTAAVASTGRAVASTSRAVGGALSHARGALSGWWSALTAPTTAPTDPDVVDLSDVSDAVEADDATGQLEDDNRRLPDPPDKRLDPAIDVTNIRVI-