logo
Contact Us
About MICASE
History
Speech Event & Speaker Attributes
Statistical Overview
Transcription & Spelling Conventions
SoundScriber
>FAQ
MICASE Manual

Statistical Overview of Speakers and Speech Events

The four tables provided here give descriptive frequency figures for MICASE, showing the composition of the corpus by speaker and speech event categories. These word counts are for the untagged version of MICASE, and ‘words' were defined as strings bounded by spaces or punctuation; hyphenated words (e.g., eighty-five, cross-sectional, non-native) and words with apostrophes (e.g., she'll, shouldn't've, don't, John's) counted as one word.

Table 1          Speaker and word counts by speaker categories

 Speaker Category

Total
Speakers

Total Words

% of
Total Corpus

Gender

Male

729

786,487

46%

Female

842

909,053

54%

Academic Role

Faculty
            Male
            Female

160

825,829

49%*

84

446,925

26%

76

378,904

22%

Students

1,039

742,348

44%*

 

Undergraduates
            Male
            Female

782

368,433

22%

336

142,102

8%

446

226,331

13%

Graduates
            Male
            Female

257

373,915

22%

121

158,696

9%

136

215,219

13%

Language Status

Native Speakers

1,449

1,493,586

88%

Non-native speakers

122

201,954

12%

Totals

 

1,571

1,695,540

 

 

Table 2           Speaker and word counts by academic division

Academic
 Division

Speech Events

Speakers

Words

% of Total Corpus

% Male

%
Female

% Faculty*

%  Students*

Humanities & Arts

37

368

450,348

27

56

44

63

29

Social Sciences & Education

34

433

404,668

24

37

63

44

55

Biological & Health Sciences

32

257

325,456

19

41

59

55

42

Physical Sciences & Engineering

36

314

358,776

21

55

45

44

52

Other/NA

13

199

156,292

9

37

63

20

41

 

Table 3       Speaker and word counts by interactivity rating (previously primary discourse mode)

Primary
Discourse Mode

Speech Events

Speakers

Words

%
of Total Corpus

% Male

%
Female

%  Faculty*

%
Students*

Highly Monologic

14

41

136,239

8

50

50

84

14

Mostly Monologic

 

37

 

266

 

356,525

21

27

73

76

16

Mixed

26

258

256,384

15

46

54

26

63

Mostly Interactive

 

27

 

334

 

381,709

23

51

49

54

39

Highly Interactive

 

48

 

520

564,683

33

48

52

28

72

 

Table 4           Speaker and word counts by speech event type

Speech Event Type

Transcripts

Speakers

Words

% of Total Corpus

% Male

%
Female

% Faculty &/or Staff

%
Students

Advising

2

11

35,275

2.1

13

87

70

30

Colloquia

14

121

157,333

9.3

51

49

89

11

Discussion Sections

9

112

74,904

4.4

37

63

33

67

Diss.
Defenses

4

26

56,837

3.4

55

45

37

63

Interviews

3

6

13,015

0.8

83

17

56

44

Labs

8

42

73,815

4.4

70

30

32

68

Large
Lectures

30

214

251,632

14.8

54

46

94

6

Small
Lectures

32

296

333,338

19.7

46

54

78

22

Meetings

6

60

70,038

4.1

66

34

38

62

Office Hours

14

106

171,188

8.2

41

59

29

71

Seminars

7

72

138,626

7.7

60

40

65

35

Study Groups

8

36

129,725

7.7

32

68

0

100

Student Presentations

11

146

143,369

8.5

24

76

22

78

Service Encounters

2

90

24,691

1.5

41

59

40

60

Tours

2

19

21,768

1.3

58

42

39

61

 

* Note: In these tables, percentages for faculty and students do not add up to 100% because of other speaker roles (e.g. staff, researchers, visitors) not included in these counts.