Corpus: SIARAD

The Siarad corpus consists of 69 recordings and transcripts of conversations from 151 speakers, totalling 40 hours, and containing 460,000 word tokens. The conversations were collected over the period 2005-07, and transcribed over the period 2006-08. A detailed documentation file is available in pdf format, and a spreadsheet contains the output from the questionnaires.

Information about the participants in the conversations are given below. Click on the filename to examine the conversation in more detail.

FileSpeakerAgeSex
davies1
NON18female
SAR19female
davies2
GRE23female
GWY23female
davies3
HAR13male
TOS15male
davies4
CYN57male
OSW57male
davies5
COL17male
MER18male
SIO18male
davies6
DAN25male
HEC23male
davies7
GAI16female
TRA14female
davies9
LLE22male
MOS19male
davies10
CLE58male
HIL63female
MIC52male
davies11
DER72female
OWA67male
RAC52female
davies12
CER20female
SAL19female
davies13
JAM19male
MEI20male
davies14
FRE67male
GWA53female
davies15
NEL23female
TEG26female
davies16
ADA16male
HYW16male
davies17
GLA35female
ROB31male
deuchar1
MYF65female
SER64female
fusser3
ALY32female
BEC32female
fusser4
ADW74female
BAE54male
fusser5
DYF29male
ENA42female
GWE36female
fusser6
AMR36female
ANT52female
fusser7
ANE39female
BLO36female
CIG35female
fusser8
ANG70female
BRE60female
MEN59female
fusser9
ABE58male
BAG57male
fusser10
ADD53male
BAR57male
fusser11
AED52male
BED77male
fusser12
CEW58female
LNW18female
WEN46female
fusser13
ANN61female
BEI65male
CRI60female
fusser14
AWE47female
BEL43male
CYW2male
fusser15
GFR50male
MRL40female
fusser16
ANW69female
SIR68female
fusser17
AET65male
BEN47male
RESunknownna
fusser18
ARD41female
BEU41male
CAIunknownmale
HAFunknownfemale
fusser19
OLW28female
TRE37male
fusser21
HAW17female
ILI16female
fusser22
EVA40female
WYN49male
fusser23
AID71male
HEL81female
fusser25
ALB25male
HUN25female
fusser26
IOL69female
TEC71male
fusser27
LIS20female
MAB19female
fusser28
IFO30male
LLA21male
MADunknownfemale
fusser29
LOI25female
MAG27female
fusser30
LON25female
MEL28female
fusser31
ARF43male
BRW12male
fusser32
LOR25female
MAT64male
STE34male
lloyd1
ART26male
GRG53male
JEA53female
SAN22female
robert1
FLO25female
REG29male
robert2
GLE19female
RIS19male
robert3
BTI15female
LUN16female
robert4
KAT24female
KIM25female
robert5
ELI89female
LIN59female
robert6
EIR56female
MOR27female
robert7
CLR57female
HUW66male
TWM34male
robert8
CLV77male
EML79male
GOR81male
INT82male
STN86male
robert9
CRL23female
PEN35male
roberts1
HEF25male
HOW33male
roberts2
ION45female
IRW45female
roberts3
LER41female
MED56female
roberts4
LIL21female
MEC19male
smith1
CEI17male
DEW45male
stammers1
EIF61male
GTH72male
stammers2
CHR10female
JAQ38female
stammers3
GUT37male
NER33female
stammers4
ALN42male
ELE40female
stammers5
RHO39male
SND36female
stammers6
BLW18female
HEU49female
IFA48male
stammers7
GWN25male
ROY31male
stammers8
CAR67female
ISL66male
stammers9
ENF67female
RNW70female

Change language


Contact us

bilingualism@bangor.ac.uk


The corpora

The Siarad corpus
The Patagonia corpus
The Miami corpus


Research Team


Collaborators


Publications


Bangor Autoglosser


Acknowledgements

The support of the Arts and Humanities Research Council (AHRC), the Economic and Social Research Council (ESRC), the Higher Education Funding Council for Wales (HEFCW) and the Welsh Government is gratefully acknowledged.