Monarch geneset OGS2.0

DPOGS202371
TranscriptDPOGS202371-TA1074 bp
ProteinDPOGS202371-PA357 aa
Genomic positionDPSCF300104 + 145817-150246
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0171671e-8174.61% 
BombyxBGIBMGA013791-TA2e-9556.02% 
DrosophilaNup62-PA1e-5553.07% 
EBI UniRef50UniRef50_F4X1821e-6440.99%Nuclear pore glycoprotein p62 n=7 Tax=Formicidae RepID=F4X182_ACREC
NCBI RefSeqXP_001606493.16e-6156.70%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3320192455e-6440.99%Nuclear pore glycoprotein p62 [Acromyrmex echinatior]
NCBI nr blastxgi|1565498021e-6944.13%PREDICTED: nuclear pore glycoprotein p62-like [Nasonia vitripennis]
Group
Gene OntologyGO:00056433.6e-70nuclear pore
GO:00170563.6e-70structural constituent of nuclear pore
KEGG pathway 
InterPro domain[2-328] IPR0077583.6e-70Nucleoporin, NSP1-like, C-terminal
Orthology groupMCL11840 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202371-TA
ATGTTTGGCCAATCACAGCCATCACAGACGCCAAGTTTCGGCCAATCCGGCCAGACCTCTACATCACAGACAACTCAAGCCATAAGCTTCAGTACAAGTCTTTCTACGGCACAAACCACTAGTGCATCGGGACTATCGACTAGCGGGTTAGGAATATCGACCAGTGGTTTCGGGCTCACAACTAGTGGGACGGGAATCTCGACGAGCGGTTTGGGAATATCATCGAGTTTAGGAGTGTCGACTGCAACAGCGGGAATCACTACTAGCAGCTTGGGGATATCCACAAGCGCTCTGGGAACCACTACCACGAGTGGTATATCTCTCGCCGGTGGCTTATCATTCGCCAAGACAACAACAGCAACTTCAACGACCCCGTCCGGTATAACAGCGTTGGGTAAAACGGCTGCAAGTGTGATGCCTCCTACTACAGTTGTTGTTGTTCTCATCGCCAGATCCCAAGCCCCTCCAGCGGCTATAACTTCAATAAATTTCGCGGAACTCCAGGAAAATATCAATAAGTGGAGTTTGTCTTTGGAGGAACAGGAGAGAGTTTTCTACAGACAAGCGACTACACTCAACGCCTGGGACAGACTATCGGCGAGTAACGGTGAAAAGATCTTAGAGCTGAACGAGGCTATAGAGACACTCAAGAACGAGCAACAGAGTTTGGAACACGAACTGGACTTTGTTCTAGCACAACAGAAGGAGCTAGAAGATATACTCACACCGTTAGAGAAGAATATAGTGGATGAAAGAGTTAGAGATCCGGAGAGAGAACATATGTATTCATTAGCGGAGACTTTGGATACACAGTTACGGCAAATGTCTGAGGACTTGAAAGAAGTCATCGAACATCTCAATGAGACCAATAGAAATCCAGACAATAATGATCCAATCGTCCAGATAGGTCGGATCCTCAACGCTCACATGTCGTCCATGCAATGGATCGACAGCTCCATAGCTCAGATAACGACCAAGCTGGACGGGCTCCGGGCGACACACGACGCGCTCAGGAAAGACAACCAGAGTGTGAAGGTGGAGCACGACCAGTCGCTTGTGTATGATGTGGCATAA

Protein sequence:

>DPOGS202371-PA
MFGQSQPSQTPSFGQSGQTSTSQTTQAISFSTSLSTAQTTSASGLSTSGLGISTSGFGLTTSGTGISTSGLGISSSLGVSTATAGITTSSLGISTSALGTTTTSGISLAGGLSFAKTTTATSTTPSGITALGKTAASVMPPTTVVVVLIARSQAPPAAITSINFAELQENINKWSLSLEEQERVFYRQATTLNAWDRLSASNGEKILELNEAIETLKNEQQSLEHELDFVLAQQKELEDILTPLEKNIVDERVRDPEREHMYSLAETLDTQLRQMSEDLKEVIEHLNETNRNPDNNDPIVQIGRILNAHMSSMQWIDSSIAQITTKLDGLRATHDALRKDNQSVKVEHDQSLVYDVA-