The number of functions from the set 1 2 3 15 to the set 0 1 that assign 0 to both 1 and 15 is

This chapter explains the features, technical details and syntaxes of the C programming language. I assume that you could write some simple programs. Otherwise, read "Introduction to Programming in C for Novices and First-time Programmers".

Introduction to C

C Standards

C is standardized as ISO/IEC 9899.

  1. K&R C: Pre-standardized C, based on Brian Kernighan and Dennis Ritchie [K&R] "The C Programming Language" 1978 book.
  2. C90 [ISO/IEC 9899:1990 "Programming Languages. C"]. Also known as ANSI C 89.
  3. C99 [ISO/IEC 9899:1999 "Programming Languages. C"]
  4. C11 [ISO/IEC 9899:2011 "Programming Languages. C"]
C Features

[TODO]

C Strength and Pitfall

[TODO]

Basic Syntaxes

Revision

Below is a simple C program that illustrates the important programming constructs [sequential flow, while-loop, and if-else] and input/output. Read "Introduction to Programming in C for Novices and First-time Programmers" if you need help in understanding this program.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 #include int main[] { int sumOdd = 0; int sumEven = 0; int upperbound; int absDiff; printf["Enter the upperbound: "]; scanf["%d", &upperbound]; int number = 1; while [number sumEven] { absDiff = sumOdd - sumEven; } else { absDiff = sumEven - sumOdd; } printf["The sum of odd numbers is %d.\n", sumOdd]; printf["The sum of even numbers is %d.\n", sumEven]; printf["The absolute difference is %d.\n", absDiff]; return 0; }
Enter the upperbound: 1000 The sum of odd numbers is 250000. The sum of even numbers is 250500. The absolute difference is 500.

Comments

Comments are used to document and explain your codes and program logic. Comments are not programming statements and are ignored by the compiler, but they VERY IMPORTANT for providing documentation and explanation for others to understand your program [and also for yourself three days later].

There are two kinds of comments in C:

  1. Multi-line Comment: begins with a /* and ends with a */, and can span several lines.
  2. End-of-line Comment: begins with // and lasts till the end of the current line.

You should use comments liberally to explain and document your codes. During program development, instead of deleting a chunk of statements permanently, you could comment-out these statements so that you could get them back later, if needed.

Statements and Blocks

Statement: A programming statement is the smallest independent unit in a program, just like a sentence in the English language. It performs a piece of programming action. A programming statement must be terminated by a semi-colon [;], just like an English sentence ends with a period. [Why not ends with a period like an english sentence? This is because period crashes with decimal point - it is hard for the dumb computer to differentiate between period and decimal point!]

For examples,

int number1 = 10; int number2, number3 = 99; int product; product = number1 * number2 * number3; printf["Hello\n"];

Block: A block [or a compound statement] is a group of statements surrounded by braces { }. All the statements inside the block is treated as one unit. Blocks are used as the body in constructs like function, if-else and loop, which may contain multiple statements but are treated as one unit. There is no need to put a semi-colon after the closing brace to end a complex statement. Empty block [without any statement] is permitted. For examples,

if [mark >= 50] { printf["PASS\n"]; printf["Well Done!\n"]; printf["Keep it Up!\n"]; } if [number == 88] { printf["Got it\n"]; } else { printf["Try Again\n"]; } i = 1; while [i < 8] { printf["%d\n", i]; ++i; } int main[] { ...statements... }

White Spaces and Formatting Source Codes

White Spaces: Blank, tab and new-line are collectively called white spaces. C ignores extra white spaces. That is, multiple contiguous white spaces are treated as a single white space.

You need to use a white space to separate two keywords or tokens, e.g.,

int sum=0; double average; average=sum/100.0;

Additional white spaces and extra lines are, however, ignored, e.g.,

int sum = 0 ; double average ; average = sum / 100.0;

Formatting Source Codes: As mentioned, extra white spaces are ignored and have no computational significance. However, proper indentation [with tabs and blanks] and extra empty lines greatly improves the readability of the program, which is extremely important for others [and yourself three days later] to understand your programs. For example, the following hello-world works, but can you understand the program?

#include int main[]{printf["Hello, world!\n"];return 0;}

Braces: Place the beginning brace at the end of the line, and align the ending brace with the start of the statement.

Indentation: Indent the body of a block by an extra 3 [or 4 spaces], according to its level.

For example,

#include int main[] { int mark = 70; if [mark >= 50] { printf["You Pass!\n"]; } else { printf["You Fail!\n"]; } return 0; }

Most IDEs [such as CodeBlocks, Eclipse and NetBeans] have a command to re-format your source code automatically.

Note: Traditional C-style formatting places the beginning and ending braces on the same column. For example,

#include int main[] { int mark = 70; if [mark >= 50] { printf["You Pass!\n"]; } else { printf["You Fail!\n"]; } return 0; }

Preprocessor Directives

C source code is preprocessed before it is compiled into object code [as illustrated].

A preprocessor directive, which begins with a # sign [such as #include, #define], tells the preprocessor to perform a certain action [such as including a header file, or performing text replacement], before compiling the source code into object code. Preprocessor directives are not programming statements, and therefore should NOT be terminated with a semi-colon. For example,

#include #include #define PI 3.14159265 // DO NOT terminate preprocessor directive with a semi-colon

In almost all of the C programs, we use #include to include the input/output stream library header into our program, so as to use the IO library function to carry out input/output operations [such as printf[] and scanf[]].

More on preprocessor directives later.

Variables and Types

Variables

Computer programs manipulate [or process] data. A variable is used to store a piece of data for processing. It is called variable because you can change the value stored.

More precisely, a variable is a named storage location, that stores a value of a particular data type. In other words, a variable has a name, a type and stores a value.

  • A variable has a name [or identifier], e.g., radius, area, age, height. The name is needed to uniquely identify each variable, so as to assign a value to the variable [e.g., radius=1.2], and retrieve the value stored [e.g., area = radius*radius*3.1416].
  • A variable has a type. Examples of type are,
    • int: for integers [whole numbers] such as 123 and -456;
    • double: for floating-point or real numbers such as 3.1416, -55.66, having a decimal point and fractional part.
  • A variable can store a value of that particular type. It is important to take note that a variable in most programming languages is associated with a type, and can only store value of the particular type. For example, a int variable can store an integer value such as 123, but NOT real number such as 12.34, nor texts such as "Hello".
  • The concept of type was introduced into the early programming languages to simplify interpretation of data made up of 0s and 1s. The type determines the size and layout of the data, the range of its values, and the set of operations that can be applied.

The following diagram illustrates two types of variables: int and double. An int variable stores an integer [whole number]. A double variable stores a real number.

Identifiers

An identifier is needed to name a variable [or any other entity such as a function or a class]. C imposes the following rules on identifiers:

  • An identifier is a sequence of characters, of up to a certain length [compiler-dependent, typically 255 characters], comprising uppercase and lowercase letters [a-z, A-Z], digits [0-9], and underscore "_".
  • White space [blank, tab, new-line] and other special characters [such as +, -, *, /, @, &, commas, etc.] are not allowed.
  • An identifier must begin with a letter or underscore. It cannot begin with a digit. Identifiers beginning with an underscore are typically reserved for system use.
  • An identifier cannot be a reserved keyword or a reserved literal [e.g.,int, double, if, else, for].
  • Identifiers are case-sensitive. A rose is NOT a Rose, and is NOT a ROSE.

Caution: Programmers don't use blank character in names. It is either not supported, or will pose you more challenges.

Variable Naming Convention

A variable name is a noun, or a noun phrase made up of several words. The first word is in lowercase, while the remaining words are initial-capitalized, with no spaces between words. For example, thefontSize, roomNumber, xMax, yMin, xTopLeft and thisIsAVeryLongVariableName. This convention is also known as camel-case.

Recommendations
  1. It is important to choose a name that is self-descriptive and closely reflects the meaning of the variable, e.g., numberOfStudents or numStudents.
  2. Do not use meaningless names like a, b, c, d, i, j, k, i1, j99.
  3. Avoid single-alphabet names, which is easier to type but often meaningless, unless they are common names like x, y, z for coordinates, i for index.
  4. It is perfectly okay to use long names of says 30 characters to make sure that the name accurately reflects its meaning!
  5. Use singular and plural nouns prudently to differentiate between singular and plural variables.  For example, you may use the variable row to refer to a single row number and the variable rows to refer to many rows [such as an array of rows - to be discussed later].

Variable Declaration

To use a variable in your program, you need to first "introduce" it by declaring its name and type, in one of the following syntaxes:

SyntaxExample
// Declare a variable of a specified type type identifier; // Declare multiple variables of the same type, separated by commas type identifier-1, identifier-2, ..., identifier-n; // Declare a variable and assign an initial value type identifier = value; // Declare multiple variables with initial values type identifier-1 = value-1, ..., identifier-n = value-n;   int option;   double sum, difference, product, quotient;   int magicNumber = 88;   double sum = 0.0, product = 1.0;

Example,

int mark1; mark1 = 76; int mark2; mark2 = mark1 + 10; double average; average = [mark1 + mark2] / 2.0; int mark1; mark2 = "Hello";

Take note that:

  • In C, you need to declare the name of a variable before it can be used.
  • C is a "strongly-type" language. A variable takes on a type. Once the type of a variable is declared, it can only store a value belonging to this particular type. For example, an int variable can hold only integer such as 123, and NOT floating-point number such as -2.17 or text string such as "Hello". The concept of type was introduced into the early programming languages to simplify interpretation of data made up of 0s and 1s. Knowing the type of a piece of data greatly simplifies its interpretation and processing.
  • Each variable can only be declared once.
  • In C, you can declare a variable anywhere inside the program, as long as it is declared before used. [In C prior to C99, all the variables must be declared at the beginning of functions.] It is recommended that your declare a variable just before it is first used.
  • The type of a variable cannot be changed inside the program.
CAUTION: Uninitialized Variables

When a variable is declared, it contains garbage until you assign an initial value. It is important to take note that C does not issue any warning/error if you use a variable before initialize it - which certainly leads to some unexpected results. For example,

1 2 3 4 5 6 7 8 #include int main[] { int number; printf["%d\n", number]; return 0; }

Constants [const]

Constants are non-modifiable variables, declared with keyword const. Their values cannot be changed during program execution. Also, const must be initialized during declaration. For examples:

const double PI = 3.1415926;

Constant Naming Convention: Use uppercase words, joined with underscore. For example, MIN_VALUE, MAX_SIZE.

Expressions

An expression is a combination of operators [such as addition '+', subtraction '-', multiplication '*', division '/'] and operands [variables or literal values], that can be evaluated to yield a single value of a certain type. For example,

1 + 2 * 3 int sum, number; sum + number double principal, interestRate; principal * [1 + interestRate]

Assignment [=]

An assignment statement:

  1. assigns a literal value [of the RHS] to a variable [of the LHS]; or
  2. evaluates an expression [of the RHS] and assign the resultant value to a variable [of the LHS].

The RHS shall be a value; and the LHS shall be a variable [or memory address].

The syntax for assignment statement is:

SyntaxExample
// Assign the literal value [of the RHS] to the variable [of the LHS] variable = literal-value; // Evaluate the expression [RHS] and assign the result to the variable [LHS] variable = expression;   number = 88;   sum = sum + number;

The assignment statement should be interpreted this way: The expression on the right-hand-side [RHS] is first evaluated to produce a resultant value [called rvalue or right-value]. The rvalue is then assigned to the variable on the left-hand-side [LHS] [or lvalue, which is a location that can hold a rvalue]. Take note that you have to first evaluate the RHS, before assigning the resultant value to the LHS. For examples,

number = 8; number = number + 1;

The symbol "=" is known as the assignment operator. The meaning of "=" in programming is different from Mathematics. It denotes assignment instead of equality. The RHS is a literal value; or an expression that evaluates to a value; while the LHS must be a variable. Note that x = x + 1 is valid [and often used] in programming. It evaluates x + 1 and assign the resultant value to the variable x. x = x + 1 illegal in Mathematics. While x + y = 1 is allowed in Mathematics, it is invalid in programming [because the LHS of an assignment statement must be a variable]. Some programming languages use symbol ":=", "←", "->", or "→" as the assignment operator to avoid confusion with equality.

Fundamental Types

Integers: C supports these integer types: char, short, int, long, long long [in C11] in a non-decreasing order of size. The actual size depends on the implementation. The integers [except char] are signed number [which can hold zero, positive and negative numbers]. You could use the keyword unsigned [char|short|int|long|long long] to declare an unsigned integers [which can hold zero and positive numbers]. There are a total 10 types of integers - signed|unsigned combined with char|short|int|long|long long.

Characters: Characters [e.g., 'a', 'Z', '0', '9'] are encoded in ASCII into integers, and kept in type char. For example, character '0' is 48 [decimal] or 30H [hexadecimal]; character 'A' is 65 [decimal] or 41H [hexadecimal]; character 'a' is 97 [decimal] or 61H [hexadecimal]. Take note that the type char can be interpreted as character in ASCII code, or an 8-bit integer. Unlike int or long, which is signed, char could be signed or unsigned, depending on the implementation. You can use signed char or unsigned char to explicitly declare signed or unsigned char.

Floating-point Numbers: There are 3 floating point types: float, double and long double, for single, double and long double precision floating point numbers. float and double are represented as specified by IEEE 754 standard. A float can represent a number between ±1.40239846×10^-45 and ±3.40282347×10^38, approximated. A double can represented a number between ±4.94065645841246544×10^-324 and ±1.79769313486231570×10^308, approximated. Take note that not all real numbers can be represented by float and double, because there are infinite real numbers. Most of the values are approximated.

The table below shows the typical size, minimum, maximum for the primitive types. Again, take note that the sizes are implementation dependent.

CategoryTypeDescriptionBytes
[Typical]Minimum
[Typical]Maximum
[Typical]
Integers int
[or signed int]
Signed integer [of at least 16 bits] 4 [2] -2147483648 2147483647
unsigned int Unsigned integer [of at least 16 bits] 4 [2] 0 4294967295
char Character
[can be either signed or unsigned depends on implementation]
1    
signed char Character or signed tiny integer
[guarantee to be signed]
1 -128 127
unsigned char Character or unsigned tiny integer
[guarantee to be unsigned]
1 0 255
short
[or short int]
[or signed short]
[or signed short int]
Short signed integer [of at least 16 bits] 2 -32768 32767
unsigned short
[or unsigned shot int]
Unsigned short integer [of at least 16 bits] 2 0 65535
long
[or long int]
[or signed long]
[or signed long int]
Long signed integer [of at least 32 bits] 4 [8] -2147483648 2147483647
unsigned long
[or unsigned long int]
Unsigned long integer [of at least 32 bits] 4 [8] 0 same as above
long long
[or long long int]
[or signed long long]
[or signed long long int]
Very long signed integer [of at least 64 bits] 8 -263 263-1
unsigned long long
[or unsigned long long int]
Unsigned very long integer [of at least 64 bits] 8 0 264-1
Real Numbers float Floating-point number, ≈7 digits
[IEEE 754 single-precision floating point format]
4 3.4e38 3.4e-38
double Double precision floating-point number, ≈15 digits
[IEEE 754 double-precision floating point format]
8 1.7e308 1.7e-308
long double Long double precision floating-point number, ≈19 digits
[IEEE 754 quadruple-precision floating point format]
12 [8]    
Wide
Characters
wchar_t Wide [double-byte] character 2 [4]    

In addition, many C library functions use a type called size_t, which is equivalent [typedef] to a unsigned int, meant for counting, size or length, with 0 and positive integers.

*The sizeof Operator

C provides an unary sizeof operator to get the size of the operand [in bytes]. The following program uses sizeof operator to print the size of the fundamental types.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 #include int main[] { printf["sizeof[char] is %d bytes.\n", sizeof[char]]; printf["sizeof[short] is %d bytes.\n", sizeof[short]]; printf["sizeof[int] is %d bytes.\n", sizeof[int]]; printf["sizeof[long] is %d bytes.\n", sizeof[long]]; printf["sizeof[long long] is %d bytes.\n", sizeof[long long]]; printf["sizeof[float] is %d bytes.\n", sizeof[float]]; printf["sizeof[double] is %d bytes.\n", sizeof[double]]; printf["sizeof[long double] is %d bytes.\n", sizeof[long double]]; return 0; }
sizeof[char] is 1 bytes. sizeof[short] is 2 bytes. sizeof[int] is 4 bytes. sizeof[long] is 4 bytes. sizeof[long long] is 8 bytes. sizeof[float] is 4 bytes. sizeof[double] is 8 bytes. sizeof[long double] is 12 bytes.

The results may vary among different systems.

*Header

The limits.h header contains information about limits of integer type. For example,

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 #include #include int main[] { printf["int max = %d\n", INT_MAX]; printf["int min = %d\n", INT_MIN]; printf["unsigned int max = %u\n", UINT_MAX]; printf["long max = %ld\n", LONG_MAX]; printf["long min = %ld\n", LONG_MIN]; printf["unsigned long max = %lu\n", ULONG_MAX]; printf["long long max = %lld\n", LLONG_MAX]; printf["long long min = %lld\n", LLONG_MIN]; printf["unsigned long long max = %llu\n", ULLONG_MAX]; printf["Bits in char = %d\n", CHAR_BIT]; printf["char max = %d\n", CHAR_MAX]; printf["char min = %d\n", CHAR_MIN]; printf["signed char max = %d\n", SCHAR_MAX]; printf["signed char min = %d\n", SCHAR_MIN]; printf["unsigned char max = %u\n", UCHAR_MAX]; return 0; }
int max = 2147483647 int min = -2147483648 unsigned int max = 4294967295 long max = 2147483647 long min = -2147483648 unsigned long max = 4294967295 long long max = 9223372036854775807 long long min = -9223372036854775808 unsigned long long max = 18446744073709551615 Bits in char = 8 char max = 127 char min = -128 signed char max = 127 signed char min = -128 unsigned char max = 255

Again, the outputs depend on the system.

The minimum of unsigned integer is always 0. The other constants are SHRT_MAX, SHRT_MIN, USHRT_MAX, LONG_MIN, LONG_MAX, ULONG_MAX. Try inspecting this header [search for limits.h under your compiler].

*Header

Similarly, the float.h header contain information on limits for floating point numbers, such as minimum number of significant digits [FLT_DIG, DBL_DIG, LDBL_DIG for float, double and long double], number of bits for mantissa [FLT_MANT_DIG, DBL_MANT_DIG, LDBL_MANT_DIG], maximum and minimum exponent values, etc. Try inspecting this header [search for cfloat under your compiler].

Choosing Types

As a programmer, you need to choose variables and decide on the type of the variables to be used in your programs. Most of the times, the decision is intuitive. For example, use an integer type for counting and whole number; a floating-point type for number with fractional part, char for a single character, and boolean for binary outcome.

Rule of Thumb
  • Use int for integer and double for floating point numbers. Use byte, short, long and float only if you have a good reason to choose that specific precision.
  • Use int [or unsigned int] for counting and indexing, NOT floating-point type [float or double]. This is because integer type are precise and more efficient in operations.
  • Use an integer type if possible. Use a floating-point type only if the number contains a fractional part.

Read my article on "Data Representation" if you wish to understand how the numbers and characters are represented inside the computer memory. In brief, It is important to take note that char '1' is different from int 1, short 1, float 1.0, double 1.0, and String "1". They are represented differently in the computer memory, with different precision and interpretation. For example, short 1 is "00000000 00000001", int 1 is "00000000 00000000 00000000 00000001", long long 1 is "00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000001", float 1.0 is "0 01111111 0000000 00000000 00000000", double 1.0 is "0 01111111111 0000 00000000 00000000 00000000 00000000 00000000 00000000", char '1' is "00110001".

There is a subtle difference between int 0 and double 0.0.

Furthermore, you MUST know the type of a value before you can interpret a value. For example, this value "00000000 00000000 00000000 00000001" cannot be interpreted unless you know the type.

*The typedef Statement

Typing "unsigned int" many time can get annoying. The typedef statement can be used to create a new name for an existing type. For example, you can create a new type called "uint" for "unsigned int" as follow. You should place the typedef immediately after #include. Use typedef with care because it makes the program hard to read and understand.

typedef unsigned int uint;

Many C compilers define a type called size_t, which is a typedef of unsigned int.

typedef unsigned int size_t;

Output via printf[] Function

C programs use function printf[] of library stdio to print output to the console. You need to issue a so-called preprocessor directive "#include " to use printf[].

To print a string literal such as "Hello, world", simply place it inside the parentheses, as follow:

printf[aStringLiteral];

For example,

printf["Hello, world\n"];Hello, world _

The \n represents the newline character. Printing a newline advances the cursor [denoted by _ in the above example] to the beginning of next line. printf[], by default, places the cursor after the printed string, and does not advance the cursor to the next line. For example,

printf["Hello"]; printf[", "]; printf["world!"]; printf["\n"]; printf["Hello\nworld\nagain\n"];Hello, world! Hello world again _Formatted Output via printf[]

The "f" in printf[] stands for "formatted" printing. To do formatted printing, you need to use the following syntax:

printf[formattingString, variable1, variable2, ...]

The formattingString is a string composing of normal texts and conversion specifiers. Normal texts will be printed as they are. A conversion specifier begins with a percent sign [%], followed by a code to specify the type of variable and format of the output [such as the field width and number of decimal places]. For example, %d denotes an int; %3d for an int with field-width of 3. The conversion specifiers are used as placeholders, which will be substituted by the variables given after the formatting string in a sequential manner. For example,

1 2 3 4 5 6 7 8 9 10 11 12 13 14 #include int main[] { int number1 = 12345, number2 = 678; printf["Hello, number1 is %d.\n", number1]; printf["number1=%d, number2=%d.\n", number1, number2]; printf["number1=%8d, number2=%5d.\n", number1, number2]; printf["number1=%08d, number2=%05d.\n", number1, number2]; printf["number1=%-8d, number2=%-5d.\n", number1, number2]; return 0; }
Hello, number1 is 12345. number1=12345, number2=678. number1= 12345, number2= 678. number1=00012345, number2=00678. number1=12345 , number2=678 .Type Conversion Code

The commonly-used type conversion codes are:

TypeType Conversion CodeType & Format
Integers %d [or %i] [signed] int
%u unsigned int
%o int in octal
%x, %X int in hexadecimal [%X uses uppercase A-F]
%hd, %hu short, unsigned short
%ld, %lu long, unsigned long
%lld, %llu long long, unsigned long long
Floating-point %f float in fixed notation
%e, %E float in scientific notation
%g, %G float in fixed/scientific notation depending on its value
%f, %lf [printf], %lf [scanf] double: Use %f or %lf in printf[], but %lf in scanf[].
%Lf, %Le, %LE, %Lg, %LG long double
Character %c char
String %s string

Notes:

  • For double, you must use %lf [for long float] in scanf[] [or %le, %lE, %lg, %lG], but you can use either %f or %lf in printf[] [or %e, %E, %g, %G, %le, %lE, %lg, %lG].
  • Use %% to print a % in the formatting string.

For example,

int anInt = 12345; float aFloat = 55.6677; double aDouble = 11.2233; char aChar = 'a'; char aStr[] = "Hello"; printf["The int is %d.\n", anInt]; printf["The float is %f.\n", aFloat]; printf["The double is %lf.\n", aDouble]; printf["The char is %c.\n", aChar]; printf["The string is %s.\n", aStr]; printf["The int [in hex] is %x.\n", anInt]; printf["The double [in scientific] is %le.\n", aDouble]; printf["The float [in scientific] is %E.\n", aFloat];

Using the wrong type conversion code usually produces garbage.

Field Width

You can optionally specify a field-width before the type conversion code, e.g., %3d, %6f, %20s. If the value to be formatted is shorter than the field width, it will be padded with spaces [by default]. Otherwise, the field-width will be ignored. For example,

int number = 123456; printf["number=%d.\n", number]; printf["number=%8d.\n", number]; printf["number=%3d.\n", number]; Precision [Decimal Places] for Floating-point Numbers

For floating-point numbers, you can optionally specify the number of decimal places to be printed, e.g., %6.2f, %8.3f. For example,

double value = 123.14159265; printf["value=%lf;\n", value]; printf["value=%6.2lf;\n", value]; printf["value=%9.4lf;\n", value]; printf["value=%3.2lf;\n", value]; Alignment

The output are right-aligned by default. You could include a "-" flag [before the field width] to ask for left-aligned. For example,

int i1 = 12345, i2 = 678; printf["Hello, first int is %d, second int is %5d.\n", i1, i2]; printf["Hello, first int is %d, second int is %-5d.\n", i1, i2]; char msg[] = "Hello"; printf["xx%20sxx\n", msg]; printf["xx%-20sxx\n", msg]; Others
  • + [plus sign]: display plus or minus sign preceding the number.
  • # or 0: Pad with leading # or 0.
C11's printf_s[]/scanf_s[]

C11 introduces more secure version of printf[]/scanf[] called printf_s[]/scanf_s[] to deal with mismatched conversion specifiers. Microsoft Visual C implemented its own versions of printf_s[]/scanf_s[] before C11, and issues a deprecated warning for using printf[]/scanf[].

Input via scanf[] Function

In C, you can use scanf[] function of to read inputs from keyboard. scanf[] uses the type-conversion code like printf[]. For example,

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 /* * TestScanf.c */ #include int main[] { int anInt; float aFloat; double aDouble; printf["Enter an int: "]; // Prompting message scanf["%d", &anInt]; // Read an int from keyboard and assign to variable anInt. printf["The value entered is %d.\n", anInt]; printf["Enter a floating-point number: "]; // Prompting message scanf["%f", &aFloat]; // Read a double from keyboard and assign to variable aFloat. printf["The value entered is %f.\n", aFloat]; printf["Enter a floating-point number: "]; // Prompting message scanf["%lf", &aDouble]; // Read a double from keyboard and assign to variable aDouble. printf["The value entered is %lf.\n", aDouble]; return 0; }

Notes:

  • To place the input into a variable in scanf[], you need to prefix the variable name by an ampersand sign [&]. The ampersand [&] is called address-of operator, which will be explained later. However, it is important to stress that missing ampersand [&] is a common error.
  • For double, you must use type conversion code %lf for scanf[]. You could use %f or %lf for printf[].
Return-Value for scanf[]

The scanf[] returns an int indicating the number of values read.

For example,

int number1 = 55, number2 = 66; int rcode = scanf["%d", &number]; printf["return code is %d\n", rcode]; printf["number1 is %d\n", number1]; printf["number2 is %d\n", number2];

The scanf[] returns 1 if user enters an integer which is read into the variable number. It returns 0 if user enters a non-integer [such as "hello"], and variable number is not assigned.

int number1 = 55, number2 = 66; int rcode = scanf["%d%d", &number1, &number2]; printf["return code is %d\n", rcode]; printf["number1 is %d\n", number1]; printf["number2 is %d\n", number2];

The scanf[] returns 2 if user enters two integers that are read into number1 and number2. It returns 1 if user enters an integer followed by a non-integer, and number2 will not be affected. It returns 0 if user enters a non-integer, and both number1 and number2 will not be affected.

Checking the return code of scanf[] is recommended for secure coding.

Literals for Fundamental Types and String

A literal is a specific constant value, such as 123, -456, 3.14, 'a', "Hello", that can be assigned directly to a variable; or used as part of an expression. They are called literals because they literally and explicitly identify their values.

Integer Literals

A whole number, such as 123 and -456, is treated as an int, by default. For example,

int number = -123; int sum = 4567; int bigSum = 8234567890;

An int literal may precede with a plus [+] or minus [-] sign, followed by digits. No commas or special symbols [e.g., $ or space] is allowed [e.g., 1,234 and $123 are invalid]. No preceding 0 is allowed too [e.g., 007 is invalid].

Besides the default base 10 integers, you can use a prefix '0' [zero] to denote a value in octal, prefix '0x' for a value in hexadecimal, and prefix '0b' for binary value [in some compilers], e.g.,

int number1 = 1234; int number2 = 01234; int number3 = 0x1abc; int number4 = 0b10001001;

A long literal is identified by a suffix 'L' or 'l' [avoid lowercase, which can be confused with the number one]. A long long int is identified by a suffix 'LL'. You can also use suffix 'U' for unsigned int, 'UL' for unsigned long, and 'ULL' for unsigned long long int. For example,

long number = 12345678L; long sum = 123; long long bigNumber = 987654321LL;

No suffix is needed for short literals. But you can only use integer values in the permitted range. For example,

short smallNumber = 1234567890; short midSizeNumber = -12345;Floating-point Literals

A number with a decimal point, such as 55.66 and -33.44, is treated as a double, by default. You can also express them in scientific notation, e.g., 1.2e3, -5.5E-6, where e or E denotes the exponent in power of 10. You could precede the fractional part or exponent with a plus [+] or minus [-] sign. Exponent shall be an integer. There should be no space or other characters [e.g., space] in the number.

You MUST use a suffix of 'f' or 'F' for float literals, e.g., -1.2345F. For example,

float average = 55.66; float average = 55.66f;

Use suffix 'L' [or 'l'] for long double.

Character Literals and Escape Sequences

A printable char literal is written by enclosing the character with a pair of single quotes, e.g., 'z', '$', and '9'. In C, characters are represented using 8-bit ASCII code, and can be treated as a 8-bit signed integers in arithmetic operations. In other words, char and 8-bit signed integer are interchangeable. You can also assign an integer in the range of [-128, 127] to a char variable; and [0, 255] to an unsigned char.

You can find the ASCII code table HERE.

For example,

char letter = 'a'; char anotherLetter = 98; printf["%c\n", letter]; printf["%c\n", anotherLetter]; anotherLetter += 2; printf["%c\n", anotherLetter]; printf["%d\n", anotherLetter];

Non-printable and control characters can be represented by so-called escape sequences, which begins with a back-slash [\] followed by a code. The commonly-used escape sequences are:

Escape SequenceDescriptionHex [Decimal]
\n New-line [or Line-feed] 0AH [10D]
\r Carriage-return 0DH [13D]
\t Tab 09H [9D]
\" Double-quote [needed to include " in double-quoted string] 22H [34D]
\' Single-quote 27H [39D]
\\ Back-slash [to resolve ambiguity] 5CH [92D]

Notes:

  • New-line [0AH] and carriage return [0dH], represented by \n, and \r respectively, are used as line delimiter [or end-of-line, or EOL]. However, take note that UNIX/Linux/Mac use \n as EOL, Windows use \r\n.
  • Horizontal Tab [09H] is represented as \t.
  • To resolve ambiguity, characters back-slash [\], single-quote ['] and double-quote ["] are represented using escape sequences \\, \' and \", respectively. This is because a single back-slash begins an escape sequence, while single-quotes and double-quotes are used to enclose character and string.
  • Other less commonly-used escape sequences are: \? or ?, \a for alert or bell, \b for backspace, \f for form-feed, \v for vertical tab. These may not be supported in some consoles.
The Header

The ctype.h header provides functions such as isalpha[], isdigit[], isspace[], ispunct[], isalnum[], isupper[], islower[] to determine the type of character; and toupper[], tolower[] for case conversion.

String Literals

A String literal is composed of zero of more characters surrounded by a pair of double quotes, e.g., "Hello, world!", "The sum is ", "".

String literals may contains escape sequences. Inside a String, you need to use \" for double-quote to distinguish it from the ending double-quote, e.g. "\"quoted\"". Single quote inside a String does not require escape sequence. For example,

printf["Use \\\" to place\n a \" within\ta\tstring\n"];Use \" to place a " within a string

TRY: Write a program to print the following picture. Take note that you need to use escape sequences to print special characters.

'__' [oo] +========\/ / || %%% || * ||-----|| "" ""Example [Literals]
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 #include int main[] { char gender = 'm'; unsigned short numChildren = 8; short yearOfBirth = 1945; unsigned int salary = 88000; double weight = 88.88; float gpa = 3.88f; printf["Gender is %c.\n", gender]; printf["Number of children is %u.\n", numChildren]; printf["Year of birth is %d.\n", yearOfBirth]; printf["Salary is %u.\n", salary]; printf["Weight is %.2lf.\n", weight]; printf["GPA is %.2f.\n", gpa]; return 0; }
Gender is m. Number of children is 8. Year of birth is 1945. Salary is 88000. Weight is 88.88. GPA is 3.88.

Operations

Arithmetic Operators

C supports the following arithmetic operators for numbers: short, int, long, long long, char [treated as 8-bit signed integer], unsigned short, unsigned int, unsigned long, unsigned long long, unsigned char, float, double and long double.

OperatorDescriptionUsageExamples
* Multiplication expr1 * expr2 2 * 3 → 6; 3.3 * 1.0 → 3.3
/ Division expr1 / expr2 1 / 2 → 0; 1.0 / 2.0 → 0.5
% Remainder [Modulus] expr1 % expr2 5 % 2 → 1; -5 % 2 → -1
+ Addition expr1 + expr2 1 + 2 → 3; 1.1 + 2.2 → 3.3
- Subtraction expr1 - expr2 1 - 2 → -1; 1.1 - 2.2 → -1.1

All the above operators are binary operators, i.e., they take two operands. The multiplication, division and remainder take precedence over addition and subtraction. Within the same precedence level [e.g., addition and subtraction], the expression is evaluated from left to right. For example, 1+2+3-4 is evaluated as [[1+2]+3]-4.

It is important to take note that int/int produces an int, with the result truncated, e.g., 1/2 → 0 [instead of 0.5].

Take note that C does not have an exponent [power] operator ['^' is exclusive-or, not exponent].

Arithmetic Expressions

In programming, the following arithmetic expression:

must be written as [1+2*a]/3 + [4*[b+c]*[5-d-e]]/f - 6*[7/g+h]. You cannot omit the multiplication symbol '*' [as in Mathematics].

Like Mathematics, the multiplication '*' and division '/' take precedence over addition '+' and subtraction '-'. Parentheses [] have higher precedence. The operators '+', '-', '*', and '/' are left-associative. That is, 1 + 2 + 3 + 4 is treated as [[[1+2] + 3] + 4].

Mixed-Type Operations

If both the operands of an arithmetic operation belong to the same type, the operation is carried out in that type, and the result belongs to that type. For example, int/int → int; double/double → double.

However, if the two operands belong to different types, the compiler promotes the value of the smaller type to the larger type [known as implicit type-casting]. The operation is then carried out in the larger type. For example, int/double → double/double → double. Hence, 1/2 → 0, 1.0/2.0 → 0.5, 1.0/2 → 0.5, 1/2.0 → 0.5.

For example,

TypeExampleOperation
int 2 + 3 int 2 + int 3 → int 5
double 2.2 + 3.3 double 2.2 + double 3.3 → double 5.5
mix 2 + 3.3 int 2 + double 3.3 → double 2.0 + double 3.3 → double 5.3
int 1 / 2 int 1 / int 2 → int 0
double 1.0 / 2.0 double 1.0 / double 2.0 → double 0.5
mix 1 / 2.0 int 1 / double 2.0 → double 1.0 / double 2.0 → double 0.5
Example
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 #include int main[] { int i1 = 2, i2 = 4; double d1 = 2.5, d2 = 5.2; printf["%d + %d = %d\n", i1, i2, i1+i2]; printf["%.1lf + %.1lf = %.1lf\n", d1, d2, d1+d2]; printf["%d + %.1lf = %.1lf\n", i1, d2, i1+d2]; printf["%d / %d = %d\n", i1, i2, i1/i2]; printf["%.1lf / %.1lf = %.2lf\n", d1, d2, d1/d2]; printf["%d / %.1lf = %.2lf\n", i1, d2, i1/d2]; return 0; }

Overflow/UnderFlow

Study the output of the following program:

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 #include int main[] { int i1 = 2147483647; printf["%d\n", i1 + 1]; printf["%d\n", i1 + 2]; printf["%d\n", i1 * i1]; int i2 = -2147483648; printf["%d\n", i2 - 1]; printf["%d\n", i2 - 2]; printf["%d\n", i2 * i2]; return 0; }

In arithmetic operations, the resultant value wraps around if it exceeds its range [i.e., overflow or underflow]. C runtime does not issue an error/warning message but produces incorrect result.

It is important to take note that checking of overflow/underflow is the programmer's responsibility, i.e., your job!

This feature is an legacy design, where processors were slow. Checking for overflow/underflow consumes computation power and reduces performance.

To check for arithmetic overflow [known as secure coding] is tedious. Google for "INT32-C. Ensure that operations on signed integers do not result in overflow" @ www.securecoding.cert.org.

Compound Assignment Operators

Besides the usual simple assignment operator '=' described earlier, C also provides the so-called compound assignment operators as listed:

OperatorUsageDescriptionExample
= var = expr Assign the value of the LHS to the variable at the RHS x = 5;
+= var += expr same as var = var + expr x += 5; same as x = x + 5
-= var -= expr same as var = var - expr x -= 5; same as x = x - 5
*= var *= expr same as var = var * expr x *= 5; same as x = x * 5
/= var /= expr same as var = var / expr x /= 5; same as x = x / 5
%= var %= expr same as var = var % expr x %= 5; same as x = x % 5

Increment/Decrement Operators

C supports these unary arithmetic operators: increment '++' and decrement '--'.

OperatorExampleResult
++ x++; ++x Increment by 1, same as x += 1
-- x--; --x Decrement by 1, same as x -= 1
Example
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 #include int main[] { int mark = 76; printf["%d\n", mark]; mark++; printf["%d\n", mark]; ++mark; printf["%d\n", mark]; mark = mark + 1; printf["%d\n", mark]; mark--; printf["%d\n", mark]; --mark; printf["%d\n", mark]; mark = mark - 1; printf["%d\n", mark]; return 0; }

The increment/decrement unary operator can be placed before the operand [prefix operator], or after the operands [postfix operator]. They take on different meaning in operations.

OperatorDescriptionExampleResult
++var Pre-Increment
Increment var, then use the new value of var
y = ++x; same as x=x+1; y=x;
var++ Post-Increment
Use the old value of var, then increment var
y = x++; same as oldX=x; x=x+1; y=oldX;
--var Pre-Decrement y = --x; same as x=x-1; y=x;
var-- Post-Decrement y = x--; same as oldX=x; x=x-1; y=oldX;

If '++' or '--' involves another operation, then pre- or post-order is important to specify the order of the two operations. For examples,

x = 5; printf["%d\n", x++]; x = 5; printf["%d\n", ++x];

Prefix operator [e.g, ++i] could be more efficient than postfix operator [e.g., i++] in some situations.

Implicit Type-Conversion vs. Explicit Type-Casting

Converting a value from one type to another type is called type casting [or type conversion]. There are two kinds of type casting:

  1. Implicit type-conversion performed by the compiler automatically, and
  2. Explicit type-casting via an unary type-casting operator in the form of [new-type]operand.
Implicit [Automatic] Type Conversion

When you assign a value of a fundamental [built-in] type to a variable of another fundamental type, C automatically converts the value to the receiving type, if the two types are compatible. For examples,

  • If you assign an int value to a double variable, the compiler automatically casts the int value to a double double [e.g., from 1 to 1.0] and assigns it to the double variable.
  • if you assign a double value of to an int variable, the compiler automatically casts the double value to an int value [e.g., from 1.2 to 1] and assigns it to the int variable. The fractional part would be truncated and lost. Some compilers issue a warning/error "possible loss in precision"; others do not.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 #include int main[] { int i; double d; i = 3; d = i; printf["d = %lf\n", d]; d = 5.5; i = d; printf["i = %d\n", i]; i = 6.6; printf["i = %d\n", i]; }

C will not perform automatic type conversion, if the two types are not compatible.

Explicit Type-Casting

You can explicitly perform type-casting via the so-called unary type-casting operator in the form of [new-type]operand. The type-casting operator takes one operand in the particular type, and returns an equivalent value in the new type. Take note that it is an operation that yields a resultant value, similar to an addition operation although addition involves two operands. For example,

printf["%lf\n", [double]5]; printf["%d\n", [int]5.5]; double aDouble = 5.6; int anInt = [int]aDouble;

Example: Suppose that you want to find the average [in double] of the integers between 1 and 100. Study the following codes:

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 #include int main[] { int sum = 0; double average; int number = 1; while [number "]; print[a, size]; printf["\n"]; } } } } void print[const int a[], int size] { int i; printf["{"]; for [i = 0; i < size; ++i] { printf["%d", a[i]]; if [i < size - 1] printf[","]; } printf["} "]; }

Program Notes:

  • [TODO]
Example: Sorting an Array using Insertion Sort

Wiki "Insertion Sort" for the algorithm and illustration. In brief, pass thru the list. For each element, compare with all previous elements and insert it at the correct position by shifting the other elements. For example,

{8,4,5,3,2,9,4,1} {8} {4,5,3,2,9,4,1} {4,8} {5,3,2,9,4,1} {4,5,8} {3,2,9,4,1} {3,4,5,8} {2,9,4,1} {2,3,4,5,8} {9,4,1} {2,3,4,5,8,9} {4,1} {2,3,4,4,5,8,9} {1} {1,2,3,4,4,5,8,9}

Insertion sort is also not efficient, with complexity of O[n2].

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 #include void insertionSort[int a[], int size]; void print[const int a[], int iMin, int iMax]; int main[] { const int SIZE = 8; int a[] = {8, 4, 5, 3, 2, 9, 4, 1}; print[a, 0, SIZE - 1]; printf["\n"]; insertionSort[a, SIZE]; print[a, 0, SIZE - 1]; printf["\n"]; } void insertionSort[int a[], int size] { int temp; int i, prev, shift; for [i = 1; i < size; ++i] { print[a, 0, i - 1]; print[a, i, size - 1]; printf["\n"]; for [prev = 0; prev < i; ++prev] { if [a[i] < a[prev]] { temp = a[i]; for [shift = i; shift > prev; --shift] { a[shift] = a[shift-1]; } a[prev] = temp; break; } } } } void print[const int a[], int iMin, int iMax] { int i; printf["{"]; for [i = iMin; i {} {1,4,5,3,2,9,4,8} {1} {4,5,3,2,9,4,8} => {1} {2,5,3,4,9,4,8} {1,2} {5,3,4,9,4,8} => {1,2} {3,5,4,9,4,8} {1,2,3} {5,4,9,4,8} => {1,2,3} {4,5,9,4,8} {1,2,3,4} {5,9,4,8} => {1,2,3,4} {4,9,5,8} {1,2,3,4,4} {9,5,8} => {1,2,3,4,4} {5,9,8} {1,2,3,4,4,5} {9,8} => {1,2,3,4,4,5} {8,9} {1,2,3,4,4,5,8,9}

Selection sort is also not efficient, with complexity of O[n2].

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 #include void selectionSort[int a[], int size]; void print[const int a[], int iMin, int iMax]; int main[] { const int SIZE = 8; int a[] = {8, 4, 5, 3, 2, 9, 4, 1}; print[a, 0, SIZE - 1]; printf["\n"]; selectionSort[a, SIZE]; print[a, 0, SIZE - 1]; printf["\n"]; } void selectionSort[int a[], int size] { int temp; int i, j; for [i = 0; i < size - 1; ++i] { print[a, 0, i - 1]; print[a, i, size - 1]; int minIndex = i; for [j = i + 1; j < size; ++j] { if [a[j] < a[minIndex]] minIndex = j; } if [minIndex != i] { temp = a[i]; a[i] = a[minIndex]; a[minIndex] = temp; } printf["=> "]; print[a, 0, i - 1]; print[a, i, size - 1]; printf["\n"]; } } void print[const int a[], int iMin, int iMax] { int i; printf["{"]; for [i = iMin; i

Chủ Đề