Newer
Older
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
This repository is the template for
[Software Carpentry](http://software-carpentry.org) lessons. To
create a new lesson:
1. Create a new empty repository on GitHub. (You must create a
new repository, rather than forking this repository, because
GitHub only allows a user to have one fork of a particular
repository, and you may wish to create several lessons.)
2. Clone that empty repository to your desktop.
3. Create a branch in that repository called `gh-pages`.
4. Add this repository as a remote called `template`.
5. `git pull template gh-pages` to copy the content of this
repository into your repository.
6. Create and edit files as explained below.
7. `git push origin gh-pages` to send your changes to GitHub for
viewing.
## Terms
* A *lesson* is a complete story about some subject, typically taught
in 2-4 hours.
* A *topic* is a single scene in that story, typically 5-15 minutes
long.
* A *slug* is a short identifier for something, such as `filesys` (for
"file system").
## Why Use a Template?
We have chosen to organize our lessons in a standard way so that:
1. They will have the same look and feel, and can be navigated in
predictable ways, even when they are written by different (and
multiple) people.
2. Contributors know where to put things when they are extending or
modifying lessons.
3. Content can more easily be checked. For example, we want to make
sure that every learning objective is matched by a challenge,
and that every challenge corresponds to one or more learning
objectives.
In the longer term, a standard format will also help us build tools to
remix lessons, but the formatting rules must always be justifiable in
terms of short-term gains for instructors and learners.
Instead of putting the whole lesson in one page, we expect authors to
use one short page per topic. This division shows each learning
sprint explicitly, and the small chunks will make it easier for us to
keep track of how long each piece takes. The cycle we expect in each
topic within a lesson is:
1. Explain the topic's objectives.
2. Teach it.
3. Do one or more challenges (depending on time).
We also require the following for each lesson:
* *Introductory slides* to give learners a sense of where the next
two or three hours are going to take them.
* A *reference guide* learners can use during the lesson, and look
back at afterward. This should includes a glossary of terms to
help lesson authors think through what they expect learners to be
unfamiliar with, and to make searching through lessons easier.
* An *instructor's guide* containing our collected wisdom about this
lesson and solutions to the challenge exercises. We ask everyone
who teaches for us to review and update the instructor's guide for
each lesson they taught after each workshop.
Note that the this means the solutions to the lesson's challenge
exercises will be up on the web. We have chosen to do this
because we believe in openness, and because there's no point
trying to hide something that's in a publicly-readable repository.
## Background
There are a few things you need to know in order to understand why we
do things the way we do.
1. Git uses the term *clone* to mean "a copy of a repository".
GitHub uses the term *fork* to mean, "a copy of a GitHub-hosted
repo that is also hosted on GitHub", and the term *clone* to mean
"a copy of a GitHub-hosted repo that's located on someone else's
machine". In both cases, the duplicate has a remote called
`origin` that points to the original repo; other remotes can be
added manually.
2. A user on GitHub can only have one fork of a particular repo.
This is a problem for us because an author may be involved in
writing several lessons, each of which has its own website repo.
Those website repositories ought to be forks of this one, but
since GitHub doesn't allow that, we've had to find a workaround.
3. If a repository has a branch called `gh-pages` (which stands for
"GitHub pages"), then GitHub uses the HTML and Markdown files in
that branch to create a website for the repository. If the
repository's URL is `http://github.com/darwin/finches`, the URL
for the website is `http://darwin.github.io/finches`.
4. We have chosen to use Markdown for writing pages because it's
simple to learn, and isn't tied to any specific language (the
ReStructured Text format popular in the Python world, for example,
is a complete unknown to R programmers). If authors want to use
something else for their lessons (e.g., IPython Notebooks), it's
up to them to generate and commit Markdown formatted according to
the rules below.
5. We have chosen to use Pandoc to process pages instead of Jekyll
(GitHub's default conversion tool) because Pandoc supports a much
richer dialect of Markdown than Jekyll. Like Jekyll, Pandoc looks
for a header at the top of each page formatted like this:
~~~
---
key: value
other_key: other_value
---
...stuff in the page...
~~~
and uses that data when formatting the page.
6. Using Pandoc instead of Jekyll means that we have to compile our
Markdown into HTML on our own machines and commit it to the
`gh-pages` branch of the lesson's GitHub repository. In order to
keep our source files and generated files separate, we put our
source files in a sub-directory called `pages`, and compile them
"upward" into the root directory of the lesson's repository.
7. In order to display properly, our generated HTML pages need
artwork, CSS style files, and a few bits of Javascript. We could
always load these from the web, but that would make offline
authoring difficult. Instead, each lesson's repository has a copy
of these files, and a way of updating them (and only them) on
demand.
One final note: we try not to put HTML inside Markdown because it's
ugly to read and write, and error-prone to process. Instead, we put
things that ought to be in `<div>` blocks, like the learning
objectives and challenge exercises, in blocks indented with `>`, and
do a bit of post-processing to attach the right CSS classes to these
blocks.
## Overall Layout
Each lesson is stored in a directory laid out as described below. That
directory is a self-contained Git repository (i.e., there are no
submodules or clever tricks with symbolic links).
1. `README.md`: initially a copy of this file. It should be
overwritten with short description of the lesson.
2. `pages/`: a sub-directory containing the source of the lesson's
website. See "Pages" below.
3. `code/`: a sub-directory containing all sample code. See "Code,
Data, and Figures" below.
4. `data/`: a sub-directory containing all data files for this lesson.
See "Code, Data, and Figures" below.
5. `fig/`: figures, plots, and diagrams used in the lesson. See
"Code, Data, and Figures" below.
6. `_layouts/`: page layout templates. See "Support Files" below.
7. `_includes/`: page inclusions. See "Support Files" below.
8. `css/`: style sheets used in the lesson's web site. See "Support
Files" below.
9. `img/`: artwork used in the lesson's web site. See "Support
Files" below.
10. `js/`: Javascript used in the lesson's website. See "Support
Files" below.
11. `tools/`: tools for managing lessons. See "Tools" below.
## Code, Data, and Figures
All of the software samples used in the lesson must go in a directory
called `code/`. Stand-alone data files must go in a directory called
`data/`. Groups of related data files must be put together in a
sub-directory of `data/` with a meaningful (short) name.
Figures, plots, and diagrams used in the lessons must go in a `fig/`
directory. We strongly prefer SVG for line drawings, since they are
smaller, scale better, and are easier to edit. Screenshots and other
raster images must be PNG or JPEG format.
**Notes:**
1. This mirrors the layout a scientist would use for actual work.
2. However, it may cause novice learners problems. If `code/program.py`
includes a hard-wired path to a data file, that path must be either
`datafile.ext` or `data/datafile.ext`. The first will only work if
the program is run with the lesson's root directory as the current
working directory, while the second will only work if the program is
run from within the `code/` directory. This is a learning
opportunity for students working from the command line, but a
confusing annoyance inside IDEs and the IPython Notebook (where the
tool's current working directory is less obvious). And yes, the
right answer is to pass filenames on the command line, but that
requires learners to understand how to get command line arguments,
which isn't something they'll be ready for in the first hour or two.
## Support Files
Files used to display the lesson, such as artwork, CSS, and
Javascript, are stored in directories of their own. We keep website
artwork separate from graphics used in the lesson's to make it simple
to update the former automatically. Most authors should not need to
modify any of the support files themselves.
The `_layouts/` directory holds the page templates used to translate
Markdown to HTML, while the `_includes/` directory holds snippets of
HTML that are used in several page layouts. These directories have
underscores at the start of their names to be consistent with Jekyll's
naming conventions, but the files they contain are for Pandoc.
## Tools
The `tools/` directory contains tools to help create and maintain
lessons:
* `tools/check`: make sure that everything is formatted properly, and
print error messages identifying problems if it's not.
## Pages
The `pages/` directory holds the content of the lesson.
1. `index.md`: the home page for the lesson. (See "Home Page" below.)
2. `dd-slug.md`: the topics in the lesson. `dd` is a sequence number
such as `01`, `02`, etc., and `slug` is an abbreviated single-word
mnemonic for the topic. Thus, `03-filesys.md` is the third topic in
this lesson, and is about the filesystem. (Note that we use hyphens
rather than underscores in filenames.) See "Topics" below.
3. `motivation.md`: slides for a short introductory presentation (three
minutes or less) explaining what the lesson is about and why people
would want to learn it. See "Introductory Slides" below.
4. `reference.md`: a cheat sheet summarizing key terms and commands,
syntax, etc., that can be printed and given to learners. See
"Reference Guide" below.
5. `instructors.md`: the instructor's guide for the lesson. See
"Instructor's Guide" below.
### Home Page
`index.md` must be structured as follows:
---
layout: lesson
title: Lesson Title
keywords: ["some", "key terms", "in a list"]
---
Paragraph of introductory material.
> ## Prerequisites
>
> A short paragraph describing what learners need to know
> before tackling this lesson.
## Topics
* [Topic Title 1](01-slug.html)
* [Topic Title 2](02-slug.html)
## Other Resources
* [Introduction](intro.html)
* [Reference Guide](reference.html)
* [Instructor's Guide](guide.html)
**Notes:**
1. The description of prerequisites is prose for human consumption, not
a machine-comprehensible list of dependencies. We may supplement
the former with the latter once we have more experience with this
lesson format and know what we actually want to do. The block must
be titled "Prerequisites" so we can detect it and style it
properly.
2. Software installation and configuration instructions *aren't* in the
lesson, since they may be shared with other lessons. They will be
stored centrally on the Software Carpentry web site and linked
from the lessons that need them.
### Topics
Each topic page must be structured as follows:
---
layout: topic
title: Topic Title
minutes: MM
---
> ## Learning Objectives {.objectives}
>
> * Learning objective 1
> * Learning objective 2
Paragraphs of text mixed with:
~~~ {.python}
some code:
to be displayed
~~~
~~~ {.output}
output
from
program
~~~
~~~ {.error}
error reports from program (if any)
~~~
and possibly including:
> ## Callout Box {.callout}
>
> An aside of some kind.
> ## Challenge Title {.challenge}
>
> Description of a single challenge.
> There may be several challenges.
**Notes:**
1. The "expected time" heading is called minutes to encourage people to
create topics that are short (10-15 minutes at most).
2. There are no sub-headings inside a topic other than the ones shown:
if a topic needs sub-headings, it should be broken into two or more
topics.
3. Every challenge should relate directly back to a learning objective.
### Motivational Slides
Every lesson must include a short slide deck suitable for a short
presentation (3 minutes or less) that the instructor can use to explain
to learners how knowing the subject will help them.
**Notes:**
1. *Flesh this out and provide an example.*
### Reference Guide
The reference guide is a cheat sheet for learners to print, doodle on,
and take away. Its format is deliberately unconstrained for now,
since we'll need to see a few before we can decide how they ought to
be laid out (or whether they need to be laid out the same way at all).
The last section of the reference guide must be a glossary laid out as
a definition list:
---
layout: reference
---
...commands and examples...
## Glossary
Key Word 1
: Definition of first term
Key Word 2
: Definition of second term
### Instructor's Guide
Many learners will go through the lessons outside of class, so it
seems best to keep material for instructors in a separate document,
rather than interleaved in the lesson itself. Its structure is:
---
title: Instructor's Guide
---
## Overall
One or more paragraphs laying out the lesson's legend.
## General Points
* Point
* Point
## [Topic Title 1](01-slug.html)
* Point
* Point
1. Discussion of first challenge.
2. Discussion of second challenge.
## [Topic Title 2](02-slug.html)
* Point
* Point
1. Discussion of first challenge.
2. Discussion of second challenge.
**Notes:**
1. The topic headings must match the topic titles. (Yes, we could
define these as variables in a configuration file and refer to those
variables everywhere, but in this case, repetition will be a lot
easier to read, and our validator can check that the titles line
up.)
2. The points can be anything: specific ways to introduce ideas, common
mistakes learners make and how to get out of them, or anything else.
3. Full solutions to the challenges do not have to be presented, but
every challenge should be discussed, and that discussion should
mention how long it typically takes to do. (Those estimates do
not go in the challenge itself, since they can increase learners'
stress levels.)