Gene: [00.0/BP1433TH] brain protein 14-3-3, theta subtype; stratifin; [c9112 HME1 ]

COM

[1] So far, two cDNAs of the gene have been cloned and published (Prasad GL &:1992; Leffers H &:1993). These sequences differ from each other at many positions including two nonsynonymous single nucleotide substitu- tions in the coding region, causing two amino acid substitutions. Since there are only two versions of the 14-3-3 theta cDNA, the discrepancies cannot be resolved. Additional clarifications from the authors are needed for the final verification of disputable sites (see 'Alignment').
[2] Most of the confusions related to definition of this isoform come from the SwissProt record P27348^143T_HUMAN that contains the following definition: '14-3-3 protein THETA (14-3-3 protein T-cell)(HS1 protein)'. This definition is however incorrect since it combines two different proteins, T-cell derived TAU and epithelial derived THETA/stratifin, into one entity (see Introduction in Jones DHA &:1995). In addition, there are several other confusing comments and references in the same entry such as 'the protein STRONGLY ACTIVATES protein kinase C' - that is inconsistent with the original definitions of the protein THETA, presented both in original publications and other related entries. Also, references to Fu H &:1993, gnb:L07955, and pir:S15076 are incor- rect since the sources are indeed relevant either to the BOVINE ZETA isoform (see GEM:02p25/YWHAZ) or HUMAN 14-3-3 TAU (GEM:00.0/BP1433TA).
[3] For general information about this protein family, see 'Summary' in FAM:BP1433/00.0"

SUM

The following information was extracted from the publication by Lef- fers H &: JMB, 1993 (Inst Medical Biochemistry, Danish Centre for Human Genome Research, Aarhus University). The authors identified a family of abundant acidic human keratinocyte proteins with MM 30,000-31,100 in the master two-dimensional gel database of human keratinocyte proteins (iso- electric focussing sample spot proteins: 9109 - epithelial marker strat- ifin, as well as 9124, 9125, 9126, and 9231). All of them shared amino acid sequences with each other, with protein 14-3-3, and with the kinase C inhibitory protein.Immunofluorescence of keratinocytes showed that two of the proteins (IEF SSPs 9124/9126) localize in the Golgi apparatus, whereas stratifin is distributed diffusely in the cytoplasm. Significant levels of stratifin, and in smaller amount the sample spot proteins 9124, 9125 and 9126, were detected in the cultured human keratinocyte medium suggesting that they are partially secreted by the cells. Two-dimensional gel analysis of proteins from cultured cells and fetal tissues showed that polypeptides comigrating with proteins 9124, 9125 and 9126 are ubiq- uitous and highly expressed in the brain. Stratifin, however, was present only in cultured epithelial cells and was most abundant in adult and fe- tal human tissues enriched in stratified squamous keratinising epithel- ium. The authors cloned and sequenced cDNAs encoding members of this fam- ily. The identity of the sequenced peptides from stratifin with the amino acid sequence derived from the stratifin cDNA clone indicated that this cDNA encodes the protein. The identity of clones 1054, HS1 and AS1 was less clear since (with few exceptions) none of their individual peptide sequences fits the predicted protein sequences. On the other hand, the polypeptides synthesized by clones 1054 and HS1 in the vaccinia expres- sion system comigrate with proteins 9126 and 9124, suggesting cell-type- specific expression of members of the protein family. Database searches indicated that clone HS1 corresponds to a human T-cell cDNA 14-3-3 clone, while clones 1054 and AS1 are strongly similar with the bovine 14-3-3 beta and eta sequences respectively. Microsequence data also indicated that IEF SSP 9124 is the human homologue of bovine 14-3-3 gamma. All se- quences cloned were submitted to GenBank and can be found by the follow- ing accession numbers: X57345, X57346, X57347, X57348."

COD

"<* sources="BR" ="<*" title="{PRT="><** dna="protein," prasad="'Complementary" gl="/ID:" cited="[GENBank] /ID: gnb:M93010^HUMMAMEPI| cDNA= 1245 bp;
'Human epithelial cell marker protein 1 (HMe1) mRNA';
//note: from the mammary gland\
/Date: Jan-07-1995\
/Duplicated_by: cbi:187301-302\
/Features: total= 1..1245;
5'-UTL= 1..10 //note: putative;
CDS= 11..757 //note: incl STOP_codon;
3'-UTL= 758..1245;
repeat_seg= 1118..1153 //note: repeat_family 'TG';
pA_signal= 1223..1228\
%-----------------------------------------------------------------------
<** hh="in" ="sen" h="/ID:" mol="/Note:" two="positions" cited="BR" electrophoresis="<**" leffers="Walbum" cloning="epithelial" refer="[Leffers" marker="has" been="J" e="'Molecular"><*** record="[PIR" pir="'14-3-3" cbi="/Note:" emb="'Homo" swp="'14-3-3" activator="in" leffers="Universitetsparken" from="/Date:" -="forming)\ %-----------------------------------------------------------------------
<* features="BR"><** note="BR"> /Note_1: The two cDNAs mentioned above, 'ref1' (Prasad GL &:1992) and
'ref2' (Leffers H &:1993), differ from each other at many positions
including two nonsynonymous single nucleotide substitutions in the
coding region, causing two amino acid substitutions. As only these
two versions of the 14-3-3 theta cDNA are known, the discrepancies
cannot be resolved. Additional clarifications from the authors are
needed for the final verification of disputable sites\
/Note_2: The LANDMARKS and SEQUENCE presented below are derived from
'ref2/emb:X57348' although the reason to choose this version is only
formal - it is a little longer than 'ref1/gnb:M93010'\
/Note_3: UTL - UnTransLated segment; CDS - CoDing Segment;
TLI - TransLation Initiation; TLT - TransLation Termination;
TTS - Transcription Termination Signal (polyA-signal);
TCT - TransCription Termination site (polyA-site);
DNR - DiNucletide Repeat; pAseq - polyA-sequence\
%-----------------------------------------------------------------------
<** nucleotides="in" ="H/His" case="Y/Tyr" and="V/Val," nucleotide="question;" numericals="aligned" alignment="[Correspondences" are="---------------------------------------------------------------------- //note_1: (atg........ CDS ........)tga;
| |\
//note_2: *120=Y:tac *242=A:gct\
{ref2:X57348 1 20 156=cCcCaga=162 166 523=t 890=c 910;
ref1:M93010 - 1 1=gCaCgag= 7 11 368=c 735=t 755}
//note_3: *120=H:cac *242=V:gtt\
%----------%
{ref2:X57348 942=cc 954=g 955=_ 976=_ 977=_ 979=g 980=t;
ref1:M93010 787=__ 797=c 798=a 820=t 822=t 825=a 826=_}
%----------%
{ref2:X57348 984=agg=986 990=c 1022=a 1027=g 1046=g 1051=g;
ref1:M93010 829=___=829 832=_ 863=c 868=_ 886=_ 890=_}
%----------%
{ref2:X57348 1067=g 1078=c 1215=t 1219=t 1240=t 1243=gc 1255=a;
ref1:M93010 905=_ 915=_ 1051=_ 1054=_ 1074=_ 1076=__ 1086=t}
%----------%
{ref2:X57348 1269=g 1287=_ 1317=______=1317 1318=g 1342=c;
ref1:M93010 1100=_ 1117=g 1148=tgtgtg=1153 1155=t 1179=t}
%----------%
{ref2:X57348 1379=c 1381=_ 1394=cccc=1397 1407 1450\
ref1:M93010 1216=t 1218=c 1232=gggg=1235 1245\}
----------------------------------------------------------------------
<** landmarks="BR"> {seg(1..165) = 5'-UTL(=165);
seg(166..909) = CDS(=744) //note: for 248 amino acids;
pos:166 = TLI_1 //note: translation START;
seg(910..1450) = 3'-UTL(=541);
pos:910 = TLT_1 //note: translation STOP;
seg(1286..1315) = DNR(gt*15) //note: x15 stretch;
seg(1385..1390) = TTS_1 //note: ;
pos:1407 = TCT_1 //note: polyA-site;
seg(1408..1450) = pAseq}
%-----------------------------------------------------------------------
<* ="%-------- ref2: 1=ccaggcagca gttagcccgc cgcccgcctg tgtgtcccca gagccatgga=
ref2: 51=gagagccagt ctgatccaga aggccaagct ggcagagcag gccgaacgct=
ref2: 101=atgaggacat ggcagccttc ccaggcagca gttagcccgc cgcccgcctg=
ref2: 151=tgtgtcccca gagcc=165_
%--------
<** ="%-------- pep: 1= M E R A S L I Q K A K L A E Q =
ref2: 166=atg gag aga gcc agt ctg atc cag aag gcc aag ctg gca gag cag=
pep: 16= A E R Y E D M A A F M K G A V =
ref2: 211=gcc gaa cgc tat gag gac atg gca gcc ttc atg aaa ggc gcc gtg=
pep: 31= E K G E E L S C E E R N L L S =
ref2: 256=gag aag ggc gag gag ctc tcc tgc gaa gag cga aac ctg ctc tca=
pep: 46= V A Y K N V V G G Q R A A W R =
ref2: 301=gta gcc tat aag aac gtg gtg ggc ggc cag agg gct gcc tgg agg=
pep: 61= V L S S I E Q K S N E E G S E =
ref2: 346=gtg ctg tcc agt att gag cag aaa agc aac gag gag ggc tcg gag=
pep: 76= E K G P E V R E Y R E K V E T =
ref2: 391=gag aag ggg ccc gag gtg cgt gag tac cgg gag aag gtg gag act=
pep: 91= E L Q G V C D T V L G L L D S =
ref2: 436=gag ctc cag ggc gtg tgc gac acc gtg ctg ggc ctg ctg gac agc=
pep:106= H L I K E A G D A E S R V F Y =
ref2: 481=cac ctc atc aag gag gcc ggg gac gcc gag agc cgg gtc ttc tac=
pep:121= L K M K G D Y Y R Y L A E V A =
ref2: 526=ctg aag atg aag ggt gac tac tac cgc tac ctg gcc gag gtg gcc=
pep:136= T G D D K K R I I D S A R S A =
ref2: 571=acc ggt gac gac aag aag cgc atc att gac tca gcc cgg tca gcc=
pep:151= Y Q E A M D I S K K E M P P T =
ref2: 616=tac cag gag gcc atg gac atc agc aag aag gag atg ccg ccc acc=
pep:166= N P I R L G L A L N F S V F H =
ref2: 661=aac ccc atc cgc ctg ggc ctg gcc ctg aac ttt tcc gtc ttc cac=
pep:181= Y E I A N S P E E A I S L A K =
ref2: 706=tac gag atc gcc aac agc ccc gag gag gcc atc tct ctg gcc aag=
pep:196= T T F D E A M A D L H T L S E =
ref2: 751=acc act ttc gac gag gcc atg gct gat ctg cac acc ctc agc gag=
pep:211= D S Y K D S T L I M Q L L R D =
ref2: 796=gac tcc tac aaa gac agc acc ctc atc atg cag ctg ctg cga gac=
pep:226= N L T L W T A D N A G E E G G =
ref2: 841=aac ctg aca ctg tgg acg gcc gac aac gcc ggg gaa gag ggg ggc=
pep:241= E A P Q E P Q S =248\
ref2: 886=gag gct ccc cag gag ccc cag agc=909_
%--------
<** ="%-------- ref2: 910=TGAgtgttgc ccgccaccgc cccgccctgc cccctccagt ccccgccctg=
ref2: 960=ccgagaggac tagtatgggg tgggaggccc cacccttctc ccctaggcgc=
ref2:1010=tgttcttgct ccaaagggct ccgtggagag ggactggcag agctgaggcc=
ref2:1060=acctggggct ggggatccca ctcttcttgc agctgttgag cgcacctaac=
ref2:1110=cactggtcat gcccccaccc ctgctctccg cacccgcttc ctcccgaccc=
ref2:1160=caggaccagg ctacttctcc cctcctcttg cctccctcct gcccctgctg=
ref2:1210=cctcttgatt cgtaggaatt gaggagtgtc tccgccttgt ggctgagaac=
ref2:1260=tggacagtgg caggggctgg agatggGTGT GTGTGTGTGT GTGTGTGTGT=
ref2:1310=GTGTGTgcgc gcgcgccagt gcaagaccga gactgaggga aagcatgtct=
ref2:1360=gctgggtgtg accatgtttc ctctcAATAAA_gttcccctgtgacactC_aa=
ref2:1410=aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a=1450\"

ECR

"<* cross-references="[PROT_DB] /ID: swp:P31947^143S_HUMAN| prt= 248 aa /Date: Jul-01-1993^Jun-01-1994;
/Duplicated_by: cbi:398953\
/ID: pir:S34753^S29340....| prt= 248 aa /Date: Dec-31-1993^Apr-12-1995\
/ID: prosite:PS00796-797\
/ID: aarhus/ghent-2Dpage 9109\
<* cross-references="[DNA_DB] /ID: emb:X57348^HS9112...| cDNA= 1450 bp /Date: Jul-19-1993;
/Duplicated_by: cbi:23939-940\
/ID: gnb:M93010^HUMMAMEPI| cDNA= 1245 bp /Date: Jan-07-1995;
/Duplicated_by: cbi:187301-302\"

REF

CLO,SEQ,COD,PEP,EXP "Leffers H &: J Mol Biol, 231, N4 (Jun 20), 982-998, 1993
CLO,SEQ,COD,PEP,EXP "Prasad GL &: Cell Growth Differ, 3, N8, 507-513, 1992
PEP,SEQ "Rasmussen HH &: Electrophoresis, 13, 960-969, 1992

SWI

SWISSPROT: P31947

KEY

aac, sign

CLA

coding, basic

LOC

RS

MIM

MIM: 601290

SYN

c9112 HME1

Ссылки: