Monarch geneset OGS2.0

DPOGS204586
TranscriptDPOGS204586-TA1317 bp
ProteinDPOGS204586-PA438 aa
Genomic positionDPSCF300400 - 81746-85418
RNAseq coverage419x (Rank: top 29%)
Annotation
HeliconiusHMEL0083920.088.86% 
BombyxBGIBMGA001438-TA0.086.14% 
DrosophilaMfap1-PA3e-9057.43% 
EBI UniRef50UniRef50_Q16U122e-10249.90%Microfibril-associated protein n=3 Tax=Culicidae RepID=Q16U12_AEDAE
NCBI RefSeqXP_395869.38e-12656.92%PREDICTED: similar to CG1017-PA [Apis mellifera]
NCBI nr blastpgi|3504128969e-12557.46%PREDICTED: microfibrillar-associated protein 1-like [Bombus impatiens]
NCBI nr blastxgi|910775105e-13858.31%PREDICTED: similar to microfibrillar-associated protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055761.1e-166extracellular region
KEGG pathway 
InterPro domain[4-433] IPR0097301.1e-166Micro-fibrillar-associated 1, C-terminal
Orthology groupMCL12158 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204586-TA
ATGAATGTATTACCAGCACAACCTATAGGTATTCAGAGTACTGCTGGCGCTGTACCCGTCCGCAATGAAAAAGGTGAAATATCTATGCAAAAAGTTAAGGTGCAGAGGTATATATCAGGAAAGAAACCAGATTATGCTCAAGGAGTTACCTCGTCTGAAGAATCAGAAGCTGAAGATTTTATAGAGCAACAAAGACCTGAACGTCGACACCAGATTGCACAGATTATTAGTCAGAAGGATGATATACAAAGTCATTCAGAGAATGAAGTTGACGACCCTCGTCTTCGCCGTTTACGTGTTGCACATTCTCCGCGTCGTGCTGAACACAAACCAGAGGTTATTGATGCTGAGCCAGAACCTGAATCAGAATCAAGTGAAGAGAGTAAACACAGTTCAGAAAGTGAAGATGAACTTGATGAAGAAGAAATTGAAAGGAGAAGACAAGCTGTTAGAGCCAAACTAGCAGCTAGAGAGGCAGAAAAAGAAGTATTAGGAAGAGAAGATGATGAGGAAATGTTAGATGGTGACAAAGAGGAAAGTGGTTCGTCAGATACTGAATATACTGATAGCGAAGAAGACACAGGGCCAAGAGTGAAGCCGGTGTTTGTGAGGGCGTCAGAAAGAATGACAGTAGCAGAGAGAGAACGTAAACTTAAACAGCAGAAGAAAGAAGATTCTGAGGCAAGAAAGGAGAAGGAGGAGAGAAGACGGGAAGCATTGAAATTAGTTGAAGAAACTATTCGTGCTGAACAAAGAAATACACAGTCAGAATACAAGGAGGGTAATATCAATGACGTGTGCACTGACGATGAGAATGATGAGTTGGAATATGAAGCGTGGAAATTAAGAGAGATGAAGAGAATAAAACGTGACAAGGAGGAACGGGAAGCTGCGGAGAAAGAGTTGTTAGCTATAGAGCGTATGAGAAACATGACCGAGGAAGAGCGTCGCGTCGAACTGCGACTAAATCCCAAGTTGGTCACAAACAAGTCCGTCAAGGGCAAATATAAGTTCCTTCAGAAGTATTATCACAGAGGTGCTTTCTATTTGGATAAGGAAGAAGATGTATTCAAACAAGATTTCTCGGGACCAACACTGGATGATCACTTTGATAAGACTGTTCTACCTAAGGTGATGCAAGTGAAGAAGTTTGGTAGATCGGGTCGTACAAAGTACACTCATCTAGTCGACCAAGATACAACCGAGTTCGACTCCGCGTGGAGCAATGAAGGCACAGCGGCCAGGCTTACCAACTTTAGAGGCGGAATGAAACAACAATTCGAAAAGCCATCCGCCAAATCGAAACATAACTCTTGA

Protein sequence:

>DPOGS204586-PA
MNVLPAQPIGIQSTAGAVPVRNEKGEISMQKVKVQRYISGKKPDYAQGVTSSEESEAEDFIEQQRPERRHQIAQIISQKDDIQSHSENEVDDPRLRRLRVAHSPRRAEHKPEVIDAEPEPESESSEESKHSSESEDELDEEEIERRRQAVRAKLAAREAEKEVLGREDDEEMLDGDKEESGSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKLKQQKKEDSEARKEKEERRREALKLVEETIRAEQRNTQSEYKEGNINDVCTDDENDELEYEAWKLREMKRIKRDKEEREAAEKELLAIERMRNMTEEERRVELRLNPKLVTNKSVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNEGTAARLTNFRGGMKQQFEKPSAKSKHNS-