Skip to content

Matthew (Mat) — Language Profile

New Testament | Canonical order: 40 | Chapters: 28

Build script: scripts/nt/survey/build_book_profiles.py


Summary Statistics

Metric Value
Total word tokens 18,882
Unique lexical lemmas 1,751
Type-token ratio (TTR) 0.093
Hapax legomena (once-only) 694 (39.6% of vocab)

Part-of-Speech Distribution

Δ = difference from corpus average for this testament.

Part of Speech This Book Corpus Avg Δ
Verb 21.5% 20.1% +1.4%
Noun 19.6% 20.7% -1.1%
Article 15.4% 14.6% +0.8%
Conjunction 12.6% 13.0% -0.4%
Pronoun 9.1% 8.1% +1.0%
Preposition 6.9% 8.1% -1.2%
Adjective 5.8% 6.0% -0.2%
Particle 2.6% 2.8% -0.2%
Adverb 2.0% 1.7% +0.3%
D 1.1% 1.2% -0.1%

Verb Analysis

Total verbs: 4,055 (21.5% of words)

Tense Distribution

Tense This Book Corpus Avg Δ
Present 35.7% 41.3% -5.6%
Aorist 27.7% 23.4% +4.3%
2nd Aorist 21.2% 17.8% +3.4%
Future 8.7% 5.6% +3.1%
Imperfect 3.6% 6.0% -2.4%
R 2.5% 4.9% -2.4%
2R 0.3% 0.6% -0.3%
2nd Future 0.1% 0.1% 0.0%

Voice Distribution

Voice This Book Corpus Avg Δ
Active 75.1% 73.6% +1.5%
Passive 11.1% 11.1% 0.0%
Deponent 5.1% 5.4% -0.3%
N 4.7% 5.9% -1.2%
Middle Deponent 2.5% 1.2% +1.3%
Middle 1.4% 2.6% -1.2%

Top 20 Most Frequent Lexical Lemmas

Rank Strong's Occurrences % of Book
1 G3588 2,902 15.37%
2 G2532 1,238 6.56%
3 G0846 980 5.19%
4 G1161 513 2.72%
5 G4771 482 2.55%
6 G1722 302 1.60%
7 G1510 290 1.54%
8 G3004G 282 1.49%
9 G1519 223 1.18%
10 G3165 210 1.11%
11 G3756 206 1.09%
12 G2036 182 0.96%
13 G2424G 175 0.93%
14 G3778 152 0.80%
15 G3361 138 0.73%
16 G3739 131 0.69%
17 G3956 131 0.69%
18 G1063 129 0.68%
19 G1909 126 0.67%
20 G0575 119 0.63%

Generated by berean-bible-bots. Source: STEPBible TAHOT/TAGNT (CC BY).