How do people listen in a conversation, especially in an intercultural setting, and how do they shift from listener to speaker in the particular context? This book investigates listenership behaviours of a tutor and a student in the context of academic supervision sessions at a university in the UK, comparing British tutor - British student conversations with British tutor - Japanese student conversations in English. A new research methodology, a time-aligned multimodal corpus analysis, is introduced for analysing listenership and turn-taking structure, synthesising visual data with verbal data in timeline. The method also integrates discourse-pragmatic and conversation analytic approaches with the corpus-based analysis. This work reports strategies in use of response tokens for framework shifts and multi-functional nature of hand gestures observed in the conversations. Therefore, this book is highly relevant for researchers and postgraduate students, who study pragmatic and discursive practice in intercultural settings using multimodal corpora.