Monarch geneset OGS2.0

DPOGS208589
TranscriptDPOGS208589-TA3645 bp
ProteinDPOGS208589-PA1214 aa
Genomic positionDPSCF300052 - 969088-982700
RNAseq coverage373x (Rank: top 32%)
Annotation
HeliconiusHMEL0158532e-11070.52% 
BombyxBGIBMGA005731-TA0.078.55% 
Drosophilap120ctn-PA3e-17558.95% 
EBI UniRef50UniRef50_E2AHM60.058.11%Catenin delta-2 n=9 Tax=Formicidae RepID=E2AHM6_CAMFO
NCBI RefSeqXP_391862.30.061.00%PREDICTED: similar to CG17484-PA.3 [Apis mellifera]
NCBI nr blastpgi|2700155990.053.50%hypothetical protein TcasGA2_TC001464 [Tribolium castaneum]
NCBI nr blastxgi|2700155990.053.27%hypothetical protein TcasGA2_TC001464 [Tribolium castaneum]
Group
Gene OntologyGO:00054881.8e-90binding
GO:00055151.6e-06protein binding
KEGG pathwaybta:5151872e-84 
 K05690 (CTNND1)maps-> Leukocyte transendothelial migration
    Adherens junction
InterPro domain[553-734] IPR0119891.8e-90Armadillo-like helical
[256-743] IPR0160241.3e-48Armadillo-type fold
[293-366] IPR0193943.7e-23Predicted transmembrane/coiled-coil 2 protein
[557-596] IPR0002251.6e-06Armadillo
Orthology groupMCL11021 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208589-TA
ATGTTTGCGATATCCGTACATTTATATTGTAACGGGCCGGTAGAGGGCGCGTATGGGCCGCAGCATAGCCCTGTGTCGCGTTCCGACGCCTCCACCGAAGACGCTGAACTATCAGCTGCTCTCGCTCATCAGCACCATCTGGCTATGCAGGCGGGTGATTACCCTGTCGGTGGCGGTGTGGAGCCGGACTATGGCCAGTACGTGTCGTCACCAGCCGCACATTACAATCACATGGGACACCTCACCATGGCGCCATCACAGAAATACCCTCATGTGATGGAGACTGTGTCTGTGAGTAGCGGTTACGCGGGCGGAGTGGGGGCGGGTCACTACGGCACGTACGCTCACTACGGCGCCGCGTATCCAACTCAGCCCGCTTACTTAGTTGAACCTCAAGTACCCGCTTATATGGCACAGGAATTTGTAGAGGGCGGTGGTAGCGCTTCCCCTCGCAGTGCTTCACCCGGGGCCATACCTCCGCAACACAATATGCATCTGCAACAGAGGTACGACTCAGCGTCTCTAGAACAATTAGGTCGACACTACTGTGTGACGTCACCGCGCGGTGAATACGCCCCGGACGCGTACGGCTACCAGCACTACTCCGCCGCCTACGACACTACCCACCAGCCAACGGCGTTTAAGGATTCACAAAACGGTTTGAGTTTAGGCAGCACCGGAGGGCAGTCTATGTATGGCGATGAAGAGGAGCTACAAAAGCAAATGGCCAATATGGCATTAGTTCACGGTAGTGTTGGGGTGGGAGGTCGGGAGGAGGGTGGTGGTCTTCAGTGGCGAGACCCCAACCTGCCAGAGGTCATCGGTTTCCTGAACTCGCCGTCGGATGTCGTGAAGGCAAACGCCGCCGCTTACTTACAACACCTCACCTATATGGATGACCCTAACAAACAGAAAACTAGAAGCTTAGAGAACGTGAACGAGTACCTGAAGCTAGCAGCGAACGCTGACAAGCAGCAGCTGGCGAGGATCAAAGCTGTGTTCGAGAAGAAGAATCAGAAGAGTGCGCTCTGTATCGTACAGCTGCAGAAGAAGCTGGAGGGGTACAACAAGAGAATTAAGAGCTGGGAGATCAAGGAGTTGGTGACCGGCGTCATATGGAACATGTCTTCCTGCGAGGATCTCAAGCAGTCCATTATAGACGACGCGGCTCAAGTTATATTCAACAAGGTCATCATACATCACTCCGGCTGGCATCCGACAAACCCTGGGGACACGTACTGGTCCAACGTGTTCCGTAACGCGTCCGGTGTGCTCCGTAACGCGTCCTCGGCCGGGGAGTACGCTCGCAGACGTCTCCGTTCCCTGGCGGGGCTGGCCGAGGCTCTGCTGCACACAGTGCGGGTGGCGCTCGTCAAGAACGCTATAGGGACCAAGGTCGTCGAAAATTGCGTCTGTGTCCTAAGAAATCTGTCGTACAGGTGTCAGGAAATAGAGGACCCGCTGTATGACACTCGAGCGCCTCCGACACAGTCATCGGGACAGGCGAGGATTCAAGCGAGTGCGTCGAAAGGAGAAAATCTCGGATGTTTCGGTGGAAGTAAAAAGAAAAAAGAGGGTTCGTCATCAAATTCTACGAGTCCGCTCGGCAAGAGCGATCCTGAACCACAGACGGACACGAATACTACACAGTTAGGGTACAGCGTACCCAAAGGGACGGAGATGCTTTGGTCGCCTGAGGTGGTTCCTCTATACATGGCGTTACTCCAAACATGTTCCAATCCTGAGACCCTGGAGGCCGCGGCCGGAGCTCTACAAAACCTCGCAGCTTGTTACTGGCAACCCTCCATAGATATACGTGCAGCTGTTAGGAAAGAAAAAGGGTTACCAATTCTCGTAGAACTGTTACGTATGGAAGTCGACAGGGTGGTGTGTGCTGTGGCCACAGCTCTACGTAATTTGGCCATCGATCAACGCAACAAAGAACTCATAGGAAAATACGCGATGCGTGATTTAGTACAGAAATTACCCAGCGGTAATCAACAACACGATCAGGGTACATCGGACGACACCATAGCCGCGGTACTGGCTACATTAAACGAGGTTATAAAGAAGAGTGCCGAGTTCTCACGTTCGTTGTTAGAGGCGGGCGGGGTCGAGCGCTTGTTGAATCTGACCAAACAACGTCATCGCCACACGCCCAGAGTACTAAAGTTTGCTGGCCAAGTCCTAATGACGATGTGGTCTCACGTTGAGCTGCGTGAGGTGTATCGTAAGCACGGATGGCGTGAAGCAGATTTCCTCACACCGGCCAGGGCTGCTCAACCCCGGGCAGCATCTAACACTAGCGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATGTAAACGATGCACTCAGCTCGGCTCACTTTATGTTCACGAAGCATACACTGACTCATTTAATGTTCACGATTCAAATACTCGGCTCAGTTCATAACGATGACATGCCACCCATGGATGTGAATTACGTCGACGGTCTGGGTATGGGGAACCCTAACGGCCGTCCCATGAACCTACAGGGGGTCCAAGCCCAAATGCCCCCACCAAGCAGCCGTTAA

Protein sequence:

>DPOGS208589-PA
MFAISVHLYCNGPVEGAYGPQHSPVSRSDASTEDAELSAALAHQHHLAMQAGDYPVGGGVEPDYGQYVSSPAAHYNHMGHLTMAPSQKYPHVMETVSVSSGYAGGVGAGHYGTYAHYGAAYPTQPAYLVEPQVPAYMAQEFVEGGGSASPRSASPGAIPPQHNMHLQQRYDSASLEQLGRHYCVTSPRGEYAPDAYGYQHYSAAYDTTHQPTAFKDSQNGLSLGSTGGQSMYGDEEELQKQMANMALVHGSVGVGGREEGGGLQWRDPNLPEVIGFLNSPSDVVKANAAAYLQHLTYMDDPNKQKTRSLENVNEYLKLAANADKQQLARIKAVFEKKNQKSALCIVQLQKKLEGYNKRIKSWEIKELVTGVIWNMSSCEDLKQSIIDDAAQVIFNKVIIHHSGWHPTNPGDTYWSNVFRNASGVLRNASSAGEYARRRLRSLAGLAEALLHTVRVALVKNAIGTKVVENCVCVLRNLSYRCQEIEDPLYDTRAPPTQSSGQARIQASASKGENLGCFGGSKKKKEGSSSNSTSPLGKSDPEPQTDTNTTQLGYSVPKGTEMLWSPEVVPLYMALLQTCSNPETLEAAAGALQNLAACYWQPSIDIRAAVRKEKGLPILVELLRMEVDRVVCAVATALRNLAIDQRNKELIGKYAMRDLVQKLPSGNQQHDQGTSDDTIAAVLATLNEVIKKSAEFSRSLLEAGGVERLLNLTKQRHRHTPRVLKFAGQVLMTMWSHVELREVYRKHGWREADFLTPARAAQPRAASNTSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHVNDALSSAHFMFTKHTLTHLMFTIQILGSVHNDDMPPMDVNYVDGLGMGNPNGRPMNLQGVQAQMPPPSSR-