Monarch geneset OGS2.0

DPOGS200337
TranscriptDPOGS200337-TA3006 bp
ProteinDPOGS200337-PA1001 aa
Genomic positionDPSCF300026 + 399093-404904
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0057940.076.19% 
BombyxBGIBMGA005632-TA0.072.80% 
Drosophilal(2)41Ab-PA5e-4225.03% 
EBI UniRef50UniRef50_D0ABA40.078.55%Putative cohesin subunit n=1 Tax=Heliconius melpomene RepID=D0ABA4_9NEOP
NCBI RefSeqXP_971171.26e-7728.11%PREDICTED: similar to cohesin-subunit, putative [Tribolium castaneum]
NCBI nr blastpgi|2613359190.078.55%putative cohesin subunit [Heliconius melpomene]
NCBI nr blastxgi|2613359190.078.55%putative cohesin subunit [Heliconius melpomene]
Group
KEGG pathway 
Orthology groupMCL12738 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200337-TA
ATGGAAGAAATGGGGAAATCCTTAGAGCCTTGTGCATGCGAGGAGGACAGAGAAATCGAGGAGAAAGTGTTGGTACCCGACTGCAGATGTCCAAGAAATGTCCATGATGTTGTTCTACATGATGAAACATGCTTGCTTTACAAGAACGAGAAGATGATATATCCAAGGGATGACGCCTGTGAATGTACCAGCCTGGCGAGCGGTCCACGTGAATGCGAGTGCTGCAGATGCCCTCTGGACCAATGCACTTGCAACAGAGCTGGGAGAGAAGCTTTTGATCAGTGGCGTAGTAAGAGGGACATTGGAGCAATGCCAGCTGTGTTGGAGTCGACTCACCCACTGATGAAGCGGTTCCAGGAAACTTTGAAGAGATTTTTAGAAAAAGAGAATGCGATTGCCGAAGACGAAATCACTAGATTGCGCGACGAACTACGACTCAGCAAACAGAAGTATGATCAGGATCTTGGTTCTATTTATAGGAATGATCATGACACAAACGCACAAAGATCGTTAATTGAAGAGTATGAAGCGACATTGGCCAAAAAAACAAAAGAGCGCCTGGATGAGGAACACCGAGCCCGGGAATCTAATGATAAATATAAAATAGCCAAAGAAAAGGTTGAAAAAAGTATAATAACTGAACGAGAAGCAACAGAAGAGTTAGAAGCACTCACTACATTATGTCGGCAGCTTGAAGCGTGGCGTGAGGAGACCGAATCAGATCTCACCGTTAGCCAGCGCATGTCGGACAAAATGAGGGCTGAGAAAAAGTCATTGGCAGACGAGAAAAGACAACTTGATGTCATCATTTATAGTCTCAGTAATGAAGTTTGGAAGTTGGAATCCAAACTGGAGATGTTCCAGAAGCAGATGGAAGTTAAGAATGTTGAAATGGAAAAAGTTAATGATAAGGTGACTGCTTATGCGGCAGAGCTAGAGGATCTCGAACTCGAGAAACGTAGGCTTGTTAGTTTGTGGAACTCAGTTCTTGTTAATATACAACAACGGGATAGGGTTTACGATTCTGTGCGGGATGATTACAGAGCTCTGCAAGAGAATTACCGAACGCTGTTGAACAATTTAGAGATCACAAAAAAAGTAGCAATGGAGGAGATGAATAAAGGAAAAGAATTGGCAATGAACAAGGATAAACTAACTTATGACGTCAACAACGCAACGAAAATGTATGAAGCCGAGGACGCTAAACGTCTTTTCCTAGAAACTCAAATTTCCGAATTAGCTGAATCAATCGAAATGACCGAGCGCGATGAGGAGCTTATCAAATCAGAAAATCAAACGATGCAAAATATTCTTAAAAGCACTACTAAAGAAATATGCCGTAGAAATGAACAAAAAGTGAAACTAGAAAATGAAATTTTAACAAATCTTCAAGAATGTCTTCTAAATGATAAAGCCGTGGAATCAATGGCCAATGGAATAAAAAAATTAAGAGAAATGTCGAGAAAGCAGGAAATATCTTTAATGTCCATGGAGAATCAGCATGCGAAGATAATGCTGGATATAGAAATGCACCGTAACAGACAAGCACGGAATAATGCACTCTTGGAAGAAAATCTCGGGAAAGTGAGAGAAAGAGAAAGGGAAATCGAGCAGTTGGAAGAGAAGTACGAAAAAAAGATGCTGGTTATTACAAGAAAGCAGCGAGAGCTTGATATTACTATAAAAAAGTTTAACGCGCTTAAAGAAATATTTGATATGAAGTCTCCACAAGAGCGTCGCATAGAAGAATTGGAGCAGCAGATACGAGGGATGAGGGAGCGTACAGAACAACTGCAGCACGAATGGCTCCGACTACAGGGACACGTTGTGAAATTGACAGCACATCATCATAAGATCGTTTCGGATATCAATCTTATAAATAAACAAATTCAAATTTGCGAGCAAAAAACGATGCGTATCGAGGCGGAGGGCGAACGTGTGTTATTGGAGCAGGCGCGTACGGAAAGGAACCTTCGTGAACTGCGCGGTCGTCTGGAGTTGTTGGAGCGAACACGCAAAGAGGCAACAGAGAAGAATCAGAGTGCCCAACGAGCTAACTTAGCCATTACTCATGAATATTCTGCAAATCTTAAGGACGCTGAAATGGAAATAATTCAAATTGAAGAGGAAATTGAAGCCTTTGAAAAAGAAAAGATGAATCTCGCTCAGGAGTTGGATCGCATTCAAAGAGAAGCACTCATATGGCAGAGGAAGGGCATTTTAGCAGTAGAGTTGAAGAAAAACATTCAAAATGCTAAATCTGCGGCTGGAGAAATCGGTCAGATGAGGGCTGAAATTCATAGAATGGAAGTGCGTCGTGAACAATTACGAAAAACAGGAGAAAAGTTATCAGAGGACTTGGCTTTGTATGTGACGCGTCGTGAAACAGCAATGGAAAAGAGCCGTGCATCAGCAGCCGTAGAGAAGGCGCATGGAAACGCTGGTCACACTTCCCAGTCAAATTACCATCACAAGCTCAGGCTGGCTAAAGCAGACGTGGCTAGAGTTACAAAGGATTTAGCGGAAGCTAAGGCTCATATGGAGAAACTGCAAATAGAGCAAGATCGGCTTGAGCGTGAAGTAGCGGAAACTAGCGCCGCCAACGCTAGACTCGAAGAACACGTGGCCAAGTTGTTGAAGGAATATGGAGAGGCAGAAAGGCAGAAACAATATCTTCTAGAGCGTGTTGTCCGTAGCCAACGTCTTGGAAGCGAATTGGCTACTGTTATAAAAAGGCAATCCCTTCGCGTGAAGAAACCCAGGTCAGCTGTACTACTGGAATATAAACAGAGTCGGGAGTTAAATAAACATCTAAAGAGTATTGTTGATACCCTGGTTGAGGACTATCCACATCTAGCTGACAGGTTGGAAGCCGTGTCTAACACATTGAATATCCACTCGCCAGACGACTCACCAAGACTAATAGACGATCCCTGTCTCTGTTTAGAAGTAAAAGAAGAAGATCCCAATATACCAGAAAAGCTCGAAAAAACTGCAGAGGAATAG

Protein sequence:

>DPOGS200337-PA
MEEMGKSLEPCACEEDREIEEKVLVPDCRCPRNVHDVVLHDETCLLYKNEKMIYPRDDACECTSLASGPRECECCRCPLDQCTCNRAGREAFDQWRSKRDIGAMPAVLESTHPLMKRFQETLKRFLEKENAIAEDEITRLRDELRLSKQKYDQDLGSIYRNDHDTNAQRSLIEEYEATLAKKTKERLDEEHRARESNDKYKIAKEKVEKSIITEREATEELEALTTLCRQLEAWREETESDLTVSQRMSDKMRAEKKSLADEKRQLDVIIYSLSNEVWKLESKLEMFQKQMEVKNVEMEKVNDKVTAYAAELEDLELEKRRLVSLWNSVLVNIQQRDRVYDSVRDDYRALQENYRTLLNNLEITKKVAMEEMNKGKELAMNKDKLTYDVNNATKMYEAEDAKRLFLETQISELAESIEMTERDEELIKSENQTMQNILKSTTKEICRRNEQKVKLENEILTNLQECLLNDKAVESMANGIKKLREMSRKQEISLMSMENQHAKIMLDIEMHRNRQARNNALLEENLGKVREREREIEQLEEKYEKKMLVITRKQRELDITIKKFNALKEIFDMKSPQERRIEELEQQIRGMRERTEQLQHEWLRLQGHVVKLTAHHHKIVSDINLINKQIQICEQKTMRIEAEGERVLLEQARTERNLRELRGRLELLERTRKEATEKNQSAQRANLAITHEYSANLKDAEMEIIQIEEEIEAFEKEKMNLAQELDRIQREALIWQRKGILAVELKKNIQNAKSAAGEIGQMRAEIHRMEVRREQLRKTGEKLSEDLALYVTRRETAMEKSRASAAVEKAHGNAGHTSQSNYHHKLRLAKADVARVTKDLAEAKAHMEKLQIEQDRLEREVAETSAANARLEEHVAKLLKEYGEAERQKQYLLERVVRSQRLGSELATVIKRQSLRVKKPRSAVLLEYKQSRELNKHLKSIVDTLVEDYPHLADRLEAVSNTLNIHSPDDSPRLIDDPCLCLEVKEEDPNIPEKLEKTAEE-