comparison en/tour-basic.tex @ 95:47ea206351d5

Split tour into two sections.
author Bryan O'Sullivan <bos@serpentine.com>
date Fri, 13 Oct 2006 14:00:06 -0700
parents en/tour.tex@0b97b0bdc830
children 7d7ddc3a57af
comparison
equal deleted inserted replaced
94:0b97b0bdc830 95:47ea206351d5
1 \chapter{A tour of Mercurial: the basics}
2 \label{chap:tour-basic}
3
4 \section{Installing Mercurial on your system}
5 \label{sec:tour:install}
6
7 Prebuilt binary packages of Mercurial are available for every popular
8 operating system. These make it easy to start using Mercurial on your
9 computer immediately.
10
11 \subsection{Linux}
12
13 Because each Linux distribution has its own packaging tools, policies,
14 and rate of development, it's difficult to give a comprehensive set of
15 instructions on how to install Mercurial binaries. The version of
16 Mercurial that you will end up with can vary depending on how active
17 the person is who maintains the package for your distribution.
18
19 To keep things simple, I will focus on installing Mercurial from the
20 command line under the most popular Linux distributions. Most of
21 these distributions provide graphical package managers that will let
22 you install Mercurial with a single click; the package name to look
23 for is \texttt{mercurial}.
24
25 \begin{itemize}
26 \item[Debian]
27 \begin{codesample4}
28 apt-get install mercurial
29 \end{codesample4}
30
31 \item[Fedora Core]
32 \begin{codesample4}
33 yum install mercurial
34 \end{codesample4}
35
36 \item[Gentoo]
37 \begin{codesample4}
38 emerge mercurial
39 \end{codesample4}
40
41 \item[OpenSUSE]
42 \begin{codesample4}
43 yum install mercurial
44 \end{codesample4}
45
46 \item[Ubuntu] Ubuntu's Mercurial package is particularly old, and you
47 should not use it. If you know how, you can rebuild and install the
48 Debian package. It's probably easier to build Mercurial from source
49 and simply run that; see section~\ref{sec:srcinstall:unixlike} for
50 details.
51 \end{itemize}
52
53 \subsection{Mac OS X}
54
55 Lee Cantey publishes an installer of Mercurial for Mac OS~X at
56 \url{http://mercurial.berkwood.com}. This package works on both
57 Intel-~and Power-based Macs. Before you can use it, you must install
58 a compatible version of Universal MacPython~\cite{web:macpython}. This
59 is easy to do; simply follow the instructions on Lee's site.
60
61 \subsection{Solaris}
62
63 XXX.
64
65 \subsection{Windows}
66
67 Lee Cantey publishes an installer of Mercurial for Windows at
68 \url{http://mercurial.berkwood.com}. This package has no external
69 dependencies; it ``just works''.
70
71 \begin{note}
72 The Windows version of Mercurial does not automatically convert line
73 endings between Windows and Unix styles. If you want to share work
74 with Unix users, you must do a little additional configuration
75 work. XXX Flesh this out.
76 \end{note}
77
78 \section{Getting started}
79
80 To begin, we'll use the \hgcmd{version} command to find out whether
81 Mercurial is actually installed properly. The actual version
82 information that it prints isn't so important; it's whether it prints
83 anything at all that we care about.
84 \interaction{tour.version}
85
86 \subsection{Built-in help}
87
88 Mercurial provides a built-in help system. This invaluable for those
89 times when you find yourself stuck trying to remember how to run a
90 command. If you are completely stuck, simply run \hgcmd{help}; it
91 will print a brief list of commands, along with a description of what
92 each does. If you ask for help on a specific command (as below), it
93 prints more detailed information.
94 \interaction{tour.help}
95 For a more impressive level of detail (which you won't usually need)
96 run \hgcmdargs{help}{\hggopt{-v}}. The \hggopt{-v} option is short
97 for \hggopt{--verbose}, and tells Mercurial to print more information
98 than it usually would.
99
100 \section{Working with a repository}
101
102 In Mercurial, everything happens inside a \emph{repository}. The
103 repository for a project contains all of the files that ``belong to''
104 that project, along with a historical record of the project's files.
105
106 There's nothing particularly magical about a repository; it is simply
107 a directory tree in your filesystem that Mercurial treats as special.
108 You can rename delete a repository any time you like, using either the
109 command line or your file browser.
110
111 \subsection{Making a local copy of a repository}
112
113 \emph{Copying} a repository is just a little bit special. While you
114 could use a normal file copying command to make a copy of a
115 repository, it's best to use a built-in command that Mercurial
116 provides. This command is called \hgcmd{clone}, because it creates an
117 identical copy of an existing repository.
118 \interaction{tour.clone}
119 If our clone succeeded, we should now have a local directory called
120 \dirname{hello}. This directory will contain some files.
121 \interaction{tour.ls}
122 These files have the same contents and history in our repository as
123 they do in the repository we cloned.
124
125 Every Mercurial repository is complete, self-contained, and
126 independent. It contains its own private copy of a project's files
127 and history. A cloned repository remembers the location of the
128 repository it was cloned from, but it does not communicate with that
129 repository, or any other, unless you tell it to.
130
131 What this means for now is that we're free to experiment with our
132 repository, safe in the knowledge that it's a private ``sandbox'' that
133 won't affect anyone else.
134
135 \subsection{What's in a repository?}
136
137 When we take a more detailed look inside a repository, we can see that
138 it contains a directory named \dirname{.hg}. This is where Mercurial
139 keeps all of its metadata for the repository.
140 \interaction{tour.ls-a}
141
142 The contents of the \dirname{.hg} directory and its subdirectories are
143 private to Mercurial. Every other file and directory in the
144 repository is yours to do with as you please.
145
146 To introduce a little terminology, the \dirname{.hg} directory is the
147 ``real'' repository, and all of the files and directories that coexist
148 with it are said to live in the \emph{working directory}. An easy way
149 to remember the distinction is that the \emph{repository} contains the
150 \emph{history} of your project, while the \emph{working directory}
151 contains a \emph{snapshot} of your project at a particular point in
152 history.
153
154 \section{A tour through history}
155
156 One of the first things we might want to do with a new, unfamiliar
157 repository is understand its history. The \hgcmd{log} command gives
158 us a view of history.
159 \interaction{tour.log}
160 By default, this command prints a brief paragraph of output for each
161 change to the project that was recorded. In Mercurial terminology, we
162 call each of these recorded events a \emph{changeset}, because it can
163 contain a record of changes to several files.
164
165 The fields in a record of output from \hgcmd{log} are as follows.
166 \begin{itemize}
167 \item[\texttt{changeset}] This field has the format of a number,
168 followed by a colon, followed by a hexadecimal string. These are
169 \emph{identifiers} for the changeset. There are two identifiers
170 because the number is shorter and easier to type than the hex
171 string.
172 \item[\texttt{user}] The identity of the person who created the
173 changeset. This is a free-form field, but it most often contains a
174 person's name and email address.
175 \item[\texttt{date}] The date and time on which the changeset was
176 created, and the timezone in which it was created. (Thef date and
177 time are local to that timezone; they display what time and date it
178 was for the person who created the changeset.)
179 \item[\texttt{summary}] The first line of the text message that the
180 creator of the changeset entered to describe the changeset.
181 \end{itemize}
182 The default output printed by \hgcmd{log} is purely a summary; it is
183 missing a lot of detail.
184
185 \subsection{Changesets, revisions, and identification}
186
187 English being a notoriously sloppy language, we have a variety of
188 terms that have the same meaning. If you are talking about Mercurial
189 history with other people, you will find that the word ``changeset''
190 is often compressed to ``change'' or ``cset'', and sometimes a
191 changeset is referred to as a ``revision'' or a ``rev''.
192
193 While it doesn't matter what \emph{word} you use to refer to the
194 concept of ``a~changeset'', the \emph{identifier} that you use to
195 refer to ``a~\emph{specific} changeset'' is of great importance.
196 Recall that the \texttt{changeset} field in the output from
197 \hgcmd{log} identifies a changeset using both a number and a
198 hexadecimal string. The number is \emph{only valid in that
199 repository}, while the hex string is the \emph{permanent, unchanging
200 identifier} that will always identify that changeset in every copy
201 of the repository.
202
203 This distinction is important. If you send someone an email talking
204 about ``revision~33'', there's a high likelihood that their
205 revision~33 will \emph{not be the same} as yours. The reason for this
206 is that a revision number depends on the order in which changes
207 arrived in a repository, and there is no guarantee that the same
208 changes will happen in the same order in different repositories.
209 Three changes $a,b,c$ can easily appear in one repository as $0,1,2$,
210 while in another as $1,0,2$.
211
212 Mercurial uses revision numbers purely as a convenient shorthand. If
213 you need to discuss a changeset with someone, or make a record of a
214 changeset for some other reason (for example, in a bug report), use
215 the hexadecimal identifier.
216
217 \subsection{Viewing specific revisions}
218
219 To narrow the output of \hgcmd{log} down to a single revision, use the
220 \hgopt{log}{-r} (or \hgopt{log}{--rev}) option. You can use either a
221 revision number or a long-form changeset identifier, and you can
222 provide as many revisions as you want. \interaction{tour.log-r}
223
224 If you want to see the history of several revisions without having to
225 list each one, you can use \emph{range notation}; this lets you
226 express the idea ``I want all revisions between $a$ and $b$,
227 inclusive''.
228 \interaction{tour.log.range}
229 Mercurial also honours the order in which you specify revisions, so
230 \hgcmdargs{log}{-r 2:4} prints $2,3,4$ while \hgcmdargs{log}{-r 4:2}
231 prints $4,3,2$.
232
233 \subsection{More detailed information}
234
235 While the summary information printed by \hgcmd{log} is useful if you
236 already know what you're looking for, you may need to see a complete
237 description of the change, or a list of the files changed, if you're
238 trying to decide whether a changeset is the one you're looking for.
239 The \hgcmd{log} command's \hggopt{-v} (or \hggopt{--verbose})
240 option gives you this extra detail.
241 \interaction{tour.log-v}
242
243 If you want to see both the description and content of a change, add
244 the \hgopt{log}{-p} (or \hgopt{log}{--patch}) option. This displays
245 the content of a change as a \emph{unified diff} (if you've never seen
246 a unified diff before, see section~\ref{sec:mq:patch} for an overview).
247 \interaction{tour.log-vp}
248
249 \section{All about command options}
250
251 Let's take a brief break from exploring Mercurial commands to discuss
252 a pattern in the way that they work; you may find this useful to keep
253 in mind as we continiue our tour.
254
255 Mercurial has a consistent and straightforward approach to dealing
256 with the options that you can pass to commands. It follows the
257 conventions for options that are common to modern Linux and Unix
258 systems.
259 \begin{itemize}
260 \item Every option has a long name. For example, as we've already
261 seen, the \hgcmd{log} command accepts a \hgopt{log}{--rev} option.
262 \item Most options have short names, too. Instead of
263 \hgopt{log}{--rev}, we can use \hgopt{log}{-r}. (The reason that
264 some options don't have short names is that the options in question
265 are rarely used.)
266 \item Long options start with two dashes (e.g.~\hgopt{log}{--rev}),
267 while short options start with one (e.g.~\hgopt{log}{-r}).
268 \item Option naming and usage is consistent across commands. For
269 example, every command that lets you specify a changeset~ID or
270 revision number accepts both \hgopt{log}{-r} and \hgopt{log}{--rev}
271 arguments.
272 \end{itemize}
273 In the examples throughout this book, I use short options instead of
274 long. This just reflects my own preference, so don't read anything
275 significant into it.
276
277 Most commands that print output of some kind will print more output
278 when passed a \hggopt{-v} (or \hggopt{--verbose}) option, and less
279 when passed \hggopt{-q} (or \hggopt{--quiet}).
280
281 \section{Making and reviewing changes}
282
283 Now that we have a grasp of viewing history in Mercurial, let's take a
284 look at making some changes and examining them.
285
286 The first thing we'll do is isolate our experiment in a repository of
287 its own. We use the \hgcmd{clone} command, but we don't need to
288 clone a copy of the remote repository. Since we already have a copy
289 of it locally, we can just clone that instead. This is much faster
290 than cloning over the network, and cloning a local repository uses
291 less disk space in most cases, too.
292 \interaction{tour.reclone}
293 As an aside, it's often good practice to keep a ``pristine'' copy of a
294 remote repository around, which you can then make temporary clones of
295 to create sandboxes for each task you want to work on. This lets you
296 work on multiple tasks in parallel, each isolated from the others
297 until it's complete and you're ready to integrate it back. Because
298 local clones are so cheap, there's almost no overhead to cloning and
299 destroying repositories whenever you want.
300
301 In our \dirname{my-hello} repository, we have a file
302 \filename{hello.c} that contains the classic ``hello, world'' program.
303 Let's use the ancient and venerable \command{sed} command to edit this
304 file so that it prints a second line of output. (I'm only using
305 \command{sed} to do this because it's easy to write a scripted example
306 this way. Since you're not under the same constraint, you probably
307 won't want to use \command{sed}; simply use your preferred text editor to
308 do the same thing.)
309 \interaction{tour.sed}
310
311 Mercurial's \hgcmd{status} command will tell us what Mercurial knows
312 about the files in the repository.
313 \interaction{tour.status}
314 The \hgcmd{status} command prints no output for some files, but a line
315 starting with ``\texttt{M}'' for \filename{hello.c}. Unless you tell
316 it to, \hgcmd{status} will not print any output for files that have
317 not been modified.
318
319 The ``\texttt{M}'' indicates that Mercurial has noticed that we
320 modified \filename{hello.c}. Notice that we didn't need to
321 \emph{inform} Mercurial that we were going to modify the file before
322 we started, or that we had modified the file after we were done; it
323 was able to figure this out itself.
324
325 It's a little bit helpful to know that we've modified
326 \filename{hello.c}, but we might prefer to know exactly \emph{what}
327 changes we've made to it. To do this, we use the \hgcmd{diff}
328 command.
329 \interaction{tour.diff}
330
331 \section{Recording changes in a new changeset}
332
333 We can modify files, build and test our changes, and use
334 \hgcmd{status} and \hgcmd{diff} to review our changes, until we're
335 satisfied with what we've done and arrive at a natural stopping point
336 where we want to record our work in a new changeset.
337
338 The \hgcmd{commit} command lets us create a new changeset; we'll
339 usually refer to this as ``making a commit'' or ``committing''.
340
341 \subsection{Writing a commit message}
342
343 When we commit a change, Mercurial drops us into a text editor, to
344 enter a message that will describe the modifications we've made in
345 this changeset. This is called the \emph{commit message}. It will be
346 a record for readers of what we did and why, and it will be printed by
347 \hgcmd{log} after we've finished committing.
348 \interaction{tour.commit}
349
350 The editor that the \hgcmd{commit} command drops us into will contain
351 an empty line, followed by a number of lines starting with
352 ``\texttt{HG:}''.
353 \begin{codesample2}
354 \emph{empty line}
355 HG: changed hello.c
356 \end{codesample2}
357 Mercurial ignores the lines that start with ``\texttt{HG:}''; it uses
358 them only to tell us which files it's recording changes to. Modifying
359 or deleting these lines has no effect.
360
361 \subsection{Writing a good commit message}
362
363 Since \hgcmd{log} only prints the first line of a commit message by
364 default, it's best to write a commit message whose first line stands
365 alone. Here's a real example of a commit message that \emph{doesn't}
366 follow this guideline, and hence has a summary that is not readable.
367 \begin{codesample2}
368 changeset: 73:584af0e231be
369 user: Censored Person <censored.person@example.org>
370 date: Tue Sep 26 21:37:07 2006 -0700
371 summary: include buildmeister/commondefs. Add an exports and install
372 \end{codesample2}
373
374 As far as the remainder of the contents of the commit message are
375 concerned, there are no hard-and-fast rules. Mercurial itself doesn't
376 interpret or care about the contents of the commit message, though
377 your project may have policies that dictate a certain kind of
378 formatting.
379
380 My personal preference is for short, but informative, commit messages
381 that tell me something that I can't figure out with a quick glance at
382 the output of \hgcmdargs{log}{--patch}.
383
384 \subsection{Aborting a commit}
385
386 If you decide that you don't want to commit while in the middle of
387 editing a commit message, simply exit from your editor without saving
388 the file that it's editing. This will cause nothing to happen to
389 either the repository or the working directory.
390
391 If we run the \hgcmd{commit} command without any arguments, it records
392 all of the changes we've made, as reported by \hgcmd{status} and
393 \hgcmd{diff}.
394
395 \subsection{Admiring our new handywork}
396
397 Once we've finished the commit, we can use the \hgcmd{tip} command to
398 display the changeset we just created. This command produces output
399 that is identical to \hgcmd{log}, but it only displays the newest
400 revision in the repository.
401 \interaction{tour.tip}
402 We refer to the newest revision in the repository as the tip revision,
403 or simply the tip.
404
405 \section{Sharing changes}
406
407 We mentioned earlier that repositories in Mercurial are
408 self-contained. This means that the changeset we just created exists
409 only in our \dirname{my-hello} repository. Let's look at a few ways
410 that we can propagate this change into other repositories.
411
412 \subsection{Pulling changes from another repository}
413 \label{sec:tour:pull}
414
415 To get started, let's clone our original \dirname{hello} repository,
416 which does not contain the change we just committed. We'll call our
417 temporary repository \dirname{hello-pull}.
418 \interaction{tour.clone-pull}
419
420 We'll use the \hgcmd{pull} command to bring changes from
421 \dirname{my-hello} into \dirname{hello-pull}. However, blindly
422 pulling unknown changes into a repository is a somewhat scary
423 prospect. Mercurial provides the \hgcmd{incoming} command to tell us
424 what changes the \hgcmd{pull} command \emph{would} pull into the
425 repository, without actually pulling the changes in.
426 \interaction{tour.incoming}
427 (Of course, someone could cause more changesets to appear in the
428 repository that we ran \hgcmd{incoming} in, before we get a chance to
429 \hgcmd{pull} the changes, so that we could end up pulling changes that we
430 didn't expect.)
431
432 Bringing changes into a repository is a simple matter of running the
433 \hgcmd{pull} command, and telling it which repository to pull from.
434 \interaction{tour.pull}
435 As you can see from the before-and-after output of \hgcmd{tip}, we
436 have successfully pulled changes into our repository. There remains
437 one step before we can see these changes in the working directory.
438
439 \subsection{Updating the working directory}
440
441 We have so far glossed over the relationship between a repository and
442 its working directory. The \hgcmd{pull} command that we ran in
443 section~\ref{sec:tour:pull} brought changes into the repository, but
444 if we check, there's no sign of those changes in the working
445 directory. This is because \hgcmd{pull} does not (by default) touch
446 the working directory. Instead, we use the \hgcmd{update} command to
447 do this.
448 \interaction{tour.update}
449
450 It might seem a bit strange that \hgcmd{pull} doesn't update the
451 working directory automatically. There's actually a good reason for
452 this: you can use \hgcmd{update} to update the working directory to
453 the state it was in at \emph{any revision} in the history of the
454 repository. If you had the working directory updated to an old
455 revision---to hunt down the origin of a bug, say---and ran a
456 \hgcmd{pull} which automatically updated the working directory to a
457 new revision, you might not be terribly happy.
458
459 However, since pull-then-update is such a common thing to do,
460 Mercurial lets you combine the two by passing the \hgopt{pull}{-u}
461 option to \hgcmd{pull}.
462 \begin{codesample2}
463 hg pull -u
464 \end{codesample2}
465 If you look back at the output of \hgcmd{pull} in
466 section~\ref{sec:tour:pull} when we ran it without \hgopt{pull}{-u},
467 you can see that it printed a helpful reminder that we'd have to take
468 an explicit step to update the working directory:
469 \begin{codesample2}
470 (run 'hg update' to get a working copy)
471 \end{codesample2}
472
473 To find out what revision the working directory is at, use the
474 \hgcmd{parents} command.
475 \interaction{tour.parents}
476 To update the working directory to a particular revision, give a
477 revision number or changeset~ID to the \hgcmd{update} command.
478 \interaction{tour.older}
479 If you omit an explicit revision, \hgcmd{update} will update to the
480 tip revision, as shown by the second call to \hgcmd{update} in the
481 example above.
482
483 \subsection{Pushing changes to another repository}
484
485 Mercurial lets us push changes to another repository, from the
486 repository we're currently visiting. As with the example of
487 \hgcmd{pull} above, we'll create a temporary repository to push our
488 changes into.
489 \interaction{tour.clone-push}
490 The \hgcmd{outgoing} command tells us what changes would be pushed
491 into another repository.
492 \interaction{tour.outgoing}
493 And the \hgcmd{push} command does the actual push.
494 \interaction{tour.push}
495 As with \hgcmd{pull}, the \hgcmd{push} command does not update the
496 working directory in the repository that it's pushing changes into.
497 (Unlike \hgcmd{pull}, \hgcmd{push} does not provide a \texttt{-u}
498 option that updates the other repository's working directory.)
499
500 What happens if we try to pull or push changes and the receiving
501 repository already has those changes? Nothing too exciting.
502 \interaction{tour.push.nothing}
503
504 \subsection{Sharing changes over a network}
505
506 The commands we have covered in the previous few sections are not
507 limited to working with local repositories. Each works in exactly the
508 same fashion over a network connection; simply pass in a URL instead
509 of a local path.
510 \interaction{tour.outgoing.net}
511 In this example, we can see what changes we could push to the remote
512 repository, but the repository is understandably not set up to let
513 anonymous users push to it.
514 \interaction{tour.push.net}
515
516 %%% Local Variables:
517 %%% mode: latex
518 %%% TeX-master: "00book"
519 %%% End: