Monarch geneset OGS2.0

DPOGS214284
TranscriptDPOGS214284-TA1275 bp
ProteinDPOGS214284-PA424 aa
Genomic positionDPSCF300014 + 1938062-1939500
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0114123e-11251.09% 
BombyxBGIBMGA005996-TA8e-4853.01% 
Drosophilashu-PA4e-3030.92% 
EBI UniRef50UniRef50_A0NFE69e-3729.31%AGAP011458-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=A0NFE6_ANOGA
NCBI RefSeqXP_001237981.22e-3729.31%AGAP011458-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582976523e-3629.31%AGAP011458-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582976529e-3729.28%AGAP011458-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054884.1e-13binding
GO:00064579e-09protein folding
KEGG pathway 
InterPro domain[88-326] IPR0235663.5e-42Peptidyl-prolyl cis-trans isomerase, FKBP-type
[196-326] IPR0119904.1e-13Tetratricopeptide-like helical
[84-170] IPR0011799e-09Peptidyl-prolyl cis-trans isomerase, FKBP-type, domain
Orthology groupMCL25190 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214284-TA
ATGATCATTAATGTCTTCTCTAAATATTTATGTAGCAAACTTGTAACAACTGGGTCCATACTTCATATAAATGCAGATTATGACGATCAAGAATGTTACACACTTGAGTCTAAACCTTTTAAGGGGAATGATTATATTGGAGAGCCAGTAAAGTCGTTTGATATATTTGAGAAAGAATTACAACCTGTAGATACACTTGGCCTTGTTCAGAAAAAGATTTTAGAAGAAGGAGGAGGTTTGGCTCTAAGCAAGGATTGTACTGTCAGTGTTGCATATGCAGGATATTGGGAAAATGAATTTGAACCTTATGATTTTACTAAACTTGATAAACCATTGGTAGTAAATTTAAATGACAATGGACTTTTACCAGGTGTTCAAATTGCCATTGAATCCATGCTAGTTGGTGAAATGTCAGTATTTTTGTTATCTTATGAAGTCATGTATGGCGATATGGGTGTTCCACCGAAAATCAAACCTAAGGCTAACTGCGTCTTCTATTTGAAACTTATAAAAAGCATTATTACACCTAAAGATGGGAAAATTGACTTCTCAGAACCAAATATATTTGAAAGGGTTCTCCATGAAGTAAAACTGTTGTATAGCTCAGGGGTCGTATTGCATAAATCAAGAAATTATATGGCTGCAATACAATCTTTCAGGAAGGCTGTTAATATGTTGCATAGATGTCGTCTTGCCAATGAAAGTGAAGAAGCTATACAAGAAAAGTTCCTAAAAAAATTGTATATAAATTTAGCAGTGTGCTATAATGAAGTTAAGCAACCATTGAAAACATGCATAGTTTGTAACGAACTAAATAGGTTAAGAAATCTATGGAATAATGAAAAGGTCTTATATCAAAATGCTAAGGCCTTAAGGATGATTGGACAATTTGATGCTGCTGAGAAGAAACTTAGACGAGCCTTGCGTTTCTCTCCTGACAATGATAGAATTTTAGAGGAATTGAATTTGCTTCAAAAGACTAGAGATTCTTGCAATCAAAGCCGTCTCATAGATAGTAATGTGTCAAGTAATTCTAACAATAGTGCAAATGATCAGTTTAGAAATGAAGTAGATAGTCTAATAAAAAATTTTAAGGAAAATGTAAATCTTTGTAAGCTTACTCTGCCTCCAGGTCTTAATGCAGCTGAAATTGGATACATTAAGGAGGTTTGTGTCAAAGAGAATTTGTATTGCAATAAATTACAAAAAGACTACTTGTTGGATAAAGAAGAGGATAAAGTGCCCTTAGAATCCAAAATAGATCTATTCATATAA

Protein sequence:

>DPOGS214284-PA
MIINVFSKYLCSKLVTTGSILHINADYDDQECYTLESKPFKGNDYIGEPVKSFDIFEKELQPVDTLGLVQKKILEEGGGLALSKDCTVSVAYAGYWENEFEPYDFTKLDKPLVVNLNDNGLLPGVQIAIESMLVGEMSVFLLSYEVMYGDMGVPPKIKPKANCVFYLKLIKSIITPKDGKIDFSEPNIFERVLHEVKLLYSSGVVLHKSRNYMAAIQSFRKAVNMLHRCRLANESEEAIQEKFLKKLYINLAVCYNEVKQPLKTCIVCNELNRLRNLWNNEKVLYQNAKALRMIGQFDAAEKKLRRALRFSPDNDRILEELNLLQKTRDSCNQSRLIDSNVSSNSNNSANDQFRNEVDSLIKNFKENVNLCKLTLPPGLNAAEIGYIKEVCVKENLYCNKLQKDYLLDKEEDKVPLESKIDLFI-