程式語言的語法 Grammar

作者 : 陳鍾誠單位 : 金門技術學院資管系Email: ccc@kmit.edu.twURL : http://ccc.kmit.edu.tw

日期 : 112/04/20

程式語言的語法Grammar

Grammar

2 陳鍾誠 - 112/04/20

Language

3 陳鍾誠 - 112/04/20

Recursive Definition

4 陳鍾誠 - 112/04/20

Mathematical Expression

5 陳鍾誠 - 112/04/20

Structure of Expressions

6 陳鍾誠 - 112/04/20

Formal Language

7 陳鍾誠 - 112/04/20

Backus Naur Form (BNF)

8 陳鍾誠 - 112/04/201960 by J. Backus and P. Naur

EBNF (Extended BNF)

9 陳鍾誠 - 112/04/20

BNF EBNF

10 陳鍾誠 - 112/04/20

BNF EBNF

Formalism (Formal notation)

N. Chomsky

近代結構語言學之父

11 陳鍾誠 - 112/04/20N. Chromsky -

Differing structural trees for the same expression

12 陳鍾誠 - 112/04/20

Problem of Different structural trees

13 陳鍾誠 - 112/04/20

No Ambiguous Sentence

14 陳鍾誠 - 112/04/20

Context Free Language Syntactic equations of the form defined in EBNF generate context-

free languages. The term "context free” is due to Chomsky and stems from the fact

that substitution of the symbol left of = by a sequence derived from the expression to the right of = is always permitted, regardless of the context in which the symbol is embedded within the sentence.

It has turned out that this restriction to context freedom (in the sense of Chomsky) is quite acceptable for programming languages, and that it is even desirable.

Context dependence in another sense, however, is indispensible. We will return to this topic in Chapter 8.

15 陳鍾誠 - 112/04/20

Regular Expression

A language is regular, if its syntax can be expressed by a single EBNF expression.

The requirement that a single equation suffices also implies that only terminal symbols occur in the expression.

Such an expression is called a regular expression.

16 陳鍾誠 - 112/04/20

Syntax Analysis v.s. Regular Expression

The reason for our interest in regular languages lies in the fact that programs for the recognition of regular sentences are particularly simple and efficient. By "recognition" we mean the determination of the structure of the sentence, and thereby naturally the determination of whether the sentence is well formed, that is, it belongs to the language. Sentence recognition is called syntax analysis.

17 陳鍾誠 - 112/04/20

Regular Expression v.s. State Machine

For the recognition of regular sentences a finite automaton, also called a state machine, is necessary and sufficient. In each step the state machine reads the next symbol and changes state. The resulting state is solely determined by the previous state and the symbol read. If the resulting state is unique, the state machine is deterministic, otherwise nondeterministic. If the state machine is formulated as a program, the state is represented by the current point of program execution.

18 陳鍾誠 - 112/04/20

EBNF Program The analyzing program can be derived directly from the

defining syntax in EBNF. For each EBNF construct K there exists a translation rule which yields a program fragment Pr(K). The translation rules from EBNF to program text are shown below. Therein sym denotes a global variable always representing the symbol last read from the source text by a call to procedure next. Procedure error terminates program execution, signaling that the symbol sequence read so far does not belong to the language.

19 陳鍾誠 - 112/04/20

Analyzing program

20 陳鍾誠 - 112/04/20

EBNF with only 1 rule

21 陳鍾誠 - 112/04/20

First()

22 陳鍾誠 - 112/04/20

Precondition

23 陳鍾誠 - 112/04/20

Lexical Analysis for Identifier

24 陳鍾誠 - 112/04/20

Lexical Analysis for Integer

25 陳鍾誠 - 112/04/20

Scanner

The process of syntax analysis is based on a procedure to obtain the next symbol. This procedure in turn is based on the definition of symbols in terms of sequences of one or more characters. This latter procedure is called a scanner, and syntax analysis on this second, lower level, lexical analysis.

26 陳鍾誠 - 112/04/20

Lexical Analysis v.s. Syntax Analysis

27 陳鍾誠 - 112/04/20

A Scanner Example

As an example we show a scanner for a parser of EBNF. Its terminal symbols and their definition in terms of characters are

28 陳鍾誠 - 112/04/20

Procedure GetSym() –(1)

29 陳鍾誠 - 112/04/20

30 陳鍾誠 - 112/04/20

31 陳鍾誠 - 112/04/20

Syntax Analysis Overview Goal – determine if the input token stream

satisfies the syntax of the program What do we need to do this?

An expressive way to describe the syntax A mechanism that determines if the input token

stream satisfies the syntax description For lexical analysis

Regular expressions describe tokens Finite automata = mechanisms to generate tokens

from input stream

Just Use Regular Expressions?

REs can expressively describe tokens Easy to implement via DFAs

So just use them to describe the syntax of a programming language NO! – They don’t have enough power to express any non-

trivial syntax Example – Nested constructs (blocks, expressions,

statements) – Detect balanced braces:{{} {} {{} { }}}{ { { { {

}}}} }

. . .- We need unbounded counting!- FSAs cannot count except in a strictly modulo fashion

Context-Free Grammars Consist of 4 components:

Terminal symbols = token or Non-terminal symbols = syntactic variables Start symbol S = special non-terminal Productions of the form LHSRHS

LHS = single non-terminal RHS = string of terminals and non-terminals Specify how non-terminals may be expanded

Language generated by a grammar is the set of strings of terminals derived from the start symbol by repeatedly applying the productions L(G) = language generated by grammar G

S a S aS TT b T b

CFG - Example Grammar for balanced-parentheses

language S ( S ) S S

1 non-terminal: S 2 terminals: “)”, “)” Start symbol: S 2 productions

If grammar accepts a string, there is a derivation of that string using the productions “(())” S = (S) = ((S) S) = (() ) = (())

? Why is the final S required?

程式語言的語法 Grammar

Documents

C言語入門 - Yamaguchi Uweb.cc.yamaguchi-u.ac.jp/~okadalab/CLangI2014/CLangI2014_12p.p… · c言語入門第12週プログラミング言語Ⅰ(実習を含む。), 計算機言語Ⅰ・計算機言語演習Ⅰ,

プログラミング言語 10 言語処理系tau/lecture/... · 2019-07-17 · 高水準言語vs. 機械語高水準言語(e.g., C) 機械語制御構造 for, while, if, ... ˇ

フランス語母語話者の日本語：中間言語分析 · フランス語母語話者の日本語：中間言語分析フランス語、ポルトガル語、日本語、トルコ語の対照中間言語分析

平成27年度言語研修アラビア語パレスチナ方言研 …平成27年度言語研修東京外国語大学アジア・アフリカ言語文化研究所 2015 アラビア語パレスチナ方言研修テキスト2

①言語文化学部・言語情報コース - tufs.ac.jp™¢生卒論スケジューリング事例.pdf · ①言語文化学部・言語情報コース計画内容学事暦

プログラミング - OSLoptsci.eng.shizuoka.ac.jp/class/programing1/1st_lecture.pdf · 参考書：「明快入門C ... •低級言語（マシン語，アセンブリ言語）⇔高級言語（C言語，Java

Oracle Text 詳細解説...147 言語グループ含まれる言語

「プログラミング言語」 SICP 第4章～超言語的抽象～その4「プログラミング言語」 SICP 第4章～超言語的抽象～その4 五十嵐淳 igarashi@kuis.kyoto-u.ac.jp

多言語音声翻訳システム（アプリ「VoiceTra」）と …多言語音声翻訳システム（アプリ「VoiceTra」）とは・31言語間の翻訳・うち23言語は音声入力、

言語の不可算性から見る言語学と言語政策 - Tokyo …repository.tufs.ac.jp/bitstream/10108/93022/1/dt-ko-0268.pdfCsb. ＝カシューブ語、Eng. ＝英語、Fre

邏輯語的文法 -- Lojban grammar

3_C言語入門 - C言語の基本

オブジェクト指向言語オブジェクト指向言語演習

C言語入門web.cc.yamaguchi-u.ac.jp/~okadalab/CLangI2014/CLangI2014...C言語入門第11週プログラミング言語Ⅰ(実習を含む。), 計算機言語Ⅰ・計算機言語演習Ⅰ,

言語・オートマトン - iip.ist.i.kyoto-u.ac.jp · 形式言語理論は，プログラミング言語を設計する基盤になっているほか，マークアップ言語や自然言語処理，

言語解析論 - 国立大学法人岡山大学 · 言語コンピュータでの言語処理 • 2つの言語処理 – 人間の言葉à自然言語 – コンピュータà形式言語

私的言語ロック言語論と - Musashino Uロック言語論と「プライベート性」の問題一ノ瀬正樹ここで私は、1 私的言語ジョン・ロックの言語哲学に即しな

認識程式語言 - fisp.com.twcd.fisp.com.tw/05857BB/pdf_b/05855BB_ch01.pdf · 第1 章認識程式語言 3 低階語言低階語言是在電腦發展初期就開發出來的程式語言，這種語言具有機器依存

2 電子書-程式語言-10小時學c++ 語言

「多言語防災サイト」防災・災害時の多言語 ......「多言語防災サイト」Well翻訳タグAPI 「多言語防災サイト」は、閲覧者の端末の使用言語に合せて、