How to become a programmer in 21 years: February 2008

Sunday, February 17, 2008

Inheritence and Casting

In order to go through inheritence and casting, I will use the following classes:

It is possible to create 2 objects of these classes like so:

You can view these classes like in the diagram below with the base object on the left and the derived object on the right. The derived class can be viewed as a base class with the additional member variables/functions of the derived class (of course it cannot access the private member variables of the base class object within it, but they do exist in memory).

All is well and good so far. Lovely jubbly. So now we want to copy the objects. We can either copy the baseObject to the derivedObject or vice versa. One which is fine, but the other is not.

Why is this so. Well, if you have a derivedObject and want to convert it to a baseObject, you can just strip away the extra functionality from the derived class and you are left with a base object.The value of baseMember will be of that created in derivedObject, i.e. 222.

We cannot go in reverse though as if we want to create a derived object from a base object, where is the extra derived functionality going to come from?? All we have is a plain old base object, not a derivedMember in sight....

Ok, hopefully that is nice and clear. Now if we want to create pointers that point to these objects we can do the following:

This is obviously ok, and can be viewed as such:

pBaseObject expects to point to an object of size BaseClass, and pDerivedObject expects to point to an object of size DerivedClass and whaddaya know, both of them are. All is well.

So now, lets do something crazy and switch the pointers around. Point pBaseObject to derivedObject and pDerivedObject to baseObject. Again, only one of these is ok to do, and the other will result in an error. Pointing pBaseObject to derivedObject is ok as derivedObject also contains a baseObject, so when the BaseClass pointer points to it, it can just point to the BaseClass object contained within it (as on the left in the picture below).

However, you cannot do the oppossite. If pDerivedObject tries to point to a BaseClass object it expects to point to something of size DerivedClass. Where is all of the extra derived functionality going to come from?? You cannot call setDerivedMember() because it does not exist for that object. This will result in an error.

The big difference between casting between pointers and copying between objects is that when you copy a derived object to a base one, all of the derived functionality gets sliced off and all that is left is the BaseClass object. When you point a BaseClass pointer to a DerivedClass object the entire DerivedClass object remains, you can point a different pointer to a DerivedClass back to it and it will still contain all of the functionality/member values.

So, in above, pDerivedObject2 points to the origional object with the derived class functionality intact (i.e. a derivedMember and baseMember value of 222).

The rules for casting with references is the same as for casting real objects (after all, references are just another name for the objects), Thus you can assign a BaseClass reference to a DerivedClass object, but you cannot go vice versa. Note however, that assignment in this way will not 'slice' the derived object and all of the settings/functionality of it will remain intact.

That's all for now, hope you like the pictures, they took ages!!!

--
Lurning Man
--

Thursday, February 14, 2008

More Pointers and references

I talked a little last time (in Pointers and References Basics) about, well, the basics of pointers and references. I'll go into a bit of detail now about some practical uses of them.

Probably the most useful thing about references/pointers is when you pass them to functions. If you declare a function like so:

void someFunction(SomeClass a, SomeClass b);

If you call that function like so:

SomeClass a;
SomeClass b;
someFunction(a, b);

both a and b are passed by value to the function. This means that the copy constructor is called, and copies of a and b are made which are used inside the function. This is desired behaviour and all, but there is some overhead involved in calling the copy constructors every time, especially in a performance intensive program (also, remember that the destructors will have to be called each time also...).

So, as a solution to this you can do one of two things. You can pass a pointer to the objects into the function, or you can pass a reference to the objects to the function. This will mean changing the way the function is defined. In the case of passing a pointer it means defining:

void someFnByPointer(SomeClass *pA, SomeClass *pB);

And for references it means defining it as:

void someFnByRef(SomeClass &a, SomeClass &b);

However, when calling the functions, there is a difference. To pass by pointer, you either have to create a pointer to the 2 objects being passed in, and pass that in, or pass in the address of the objects. To pass by reference, you call it as normal.

//By pointer
someFnByPointer(&a, &b);
//Or by reference
someFnByRef(a, b);

In my opinion the second way looks more natural, and you don't have to worry about making sure the pointer address is correct etc. Also in the function itself you can still use the '.' notation rather than the '->' notation. So I tend to prefer passing by reference.

Actually, there is another benefit to passing by reference/pointer (from now on, I'm just going to refer to 'passing by reference', even though it is equally applicable to pointers). When passing by reference, the passed object will not get 'sliced' if the reference is to a derived object. For example, if you specify that a base class object must be passed in by value, if you pass in a derived object, the additional functionality in the derived object gets 'sliced' off (as the copy constructor will only take the values from the base class). However, if you pass by reference the copy constructor is never called and you still have the full derived object. This is especially important if there are virtual functions floating about and you want the derived ones to be called.
So if you have a class, SomeDerivedClass that inherits from SomeClass, you can do the following and still have all of the derived class functionality (provided a derived class object is passed in):

void someFunction(SomeClass &a)
{
//if a is SomeDerivedClass object, then whole object
//remains, and the derived functionality is maintained
//so the following is fine:
SomeDerivedClass c = (SomeDerivedClass)a;
}

Finally, one last plug that should be filled regards what you can and can't do with the objects that are passed to a function. The benefit of passing by value is that when you pass an object you can forget about it, safe in the knowledge that it cannot be changed by the function. The same cannot be said if you pass by reference.

SomeClass someObjectA, someObjectB;
someFnByRef(someObjectA, someObjectB);
//The function can do anything it wants to someObjectA
//and someObjectB.

This is clearly undesirable in many cases. someFnByRef() needs to guarantee that the objects passed to it won't change. To do that, just add our old friend const when declaring the function.

void someFnByRef(const SomeClass &a, const SomeClass &b);

So basically you get all of the benefits of passing by reference (i.e. faster, no slicing etc.) with none of the drawbacks (it is still safe, cannot change the passed object). It's just like a late night infomercial!!

(Note: For an excellent discussion on this, and many other useful topics, check out Scott Meyers' 'Effective C++')

--
Lurning Man
--

Thursday, February 7, 2008

C++ Pointers and References Basics

This will talk about how references and pointers relate to each other and highlight some of their differences.

Pointers and references are quite similar in what they do, as in they both utilise an already existant object/variable. However, they have a number of big differences. Pointers are more flexible, but can be a lot more dangerous, whereas references are somewhat more restrictive, but still very useful.

Ok, so here is an example of using both of them, starting with pointers. I will use the following class for illustration:
class SomeClass
{
public:
void setValue(int i){value = i;};
private:
int value;
}

So you can declare an object of this class, and then a pointer to the object like this:
SomeClass someObject;
someObject.setValue(55);

SomeClass *pToObj; //Not pointing to anything yet
pToObj = &someObject; //Pointer is initialised now
pToObj->setValue(11); //someObject.value is now changed

Pointers can be moved around and set any amount of times, so in the above example, you could set pToObj to point to another object, or something different entirely (which is why they can be so dangerous).

You are given a lot of freedom with pointers that can lead to problems, like the following where an unnitialized pointer is operated on:

SomeClass *pToObj2;
pToObj2->setValue(12); //BAD IDEA - CRASH!!

References on the other hand are much safer. They must be initialised, can only be initialised once (as a reference to some object/variable), and once they are initialised they cannot be changed.
So you can do something like this:
SomeClass someObject;
SomeClass &someRef = someObject;

But once this is done you cannot change someRef to reference another object of SomeClass (or anything else for that matter). Note that you have to initialise a reference. If you do not then the compiler will throw out an error.

There is a difference in notation when dealing with pointers and references. When accessing member variables / functions using a pointer it is done using the arrow notation. i.e.

pToObj->memberVariable;
pToObj->memberFn();

Whereas when using references, it is done using the same '.' notation as if you were using the object itself. i.e.

someRef.memberVariable;
someRef.memberFn();

References can be viewed as an alias for an object. It's like making an object of the LeadSinger class called paulHewson, having him make a band with some of his friends, and then make a reference to the paulHewson object called bono.

LeadSinger paulHewson;
paulHewson.makeBand();
LeadSinger &bono = paulHewson;
bono.sing(); //Both bono and paulHewson sing here as they are the same object!!

paulHewson and bono are just two different names for the same object. You cannot then go on and make bono into something else:

LeadSinger noelGallagher;
bono = noelGallagher;

You can, however, change pointers to point to other objects. So the following is possible:

Drummer johnPepys;
Drummer ericChilds;
Drummer peterBond;
Drummer *pSpinalTapDrummer = &johnPepys;

johnPepys.DieInGardeningAccident();
pSpinalTapDrummer = &ericChilds; //Same pointer, different object
ericChilds.ChokeOnVomit();
pSpinalTapDrummer = &peterBond;
...
and so on
...

I think that covers the basics, I've got a fair bit more to write about pointers and references in posts to come, but that's enough for now!

--
Lurning Man
--

Tuesday, February 5, 2008

Const in C++

Const is great. Seriously it is.

I would never have used const in my pre-professional days (in fact I had never used it before a month ago). What would have been the point? It doesn't really add any functionality or speed etc. But what it does do is try to help you to ensure you don't shoot yourself in the foot, and ensure that other people don't shoot both of you in the foot in some kind of non-const foot shooting massacre.

So, in general const makes a variable or object unchangable (i.e. constant). So something like this will not work:
const int x = 5;
x++; //Error...

You can, of course do something like this:
const int x = 5;
int y = x; //copies value of const variable to a non-const.
y++; //Fine
x++; //Still an error

Const actually has a load of other applications, so I'll try go through them in some sort of logical order (and by that, I mean the order that Scott Meyers does in the highly recommended Effective C++....)

So firstly, in some cases it can be used as a replacement for #defines. This is more of a limitation in using #defines that a huge positive in using consts. The idea that instead of doing something like:
#define PI 3.14
It is possible to do this:
const double pi 3.14;

Why would you want to do this?? Well, the main reason is to stop yourself (and others) getting confused and using PI incorrectly, or more importantly catching times when you use it incorrectly. When you use PI, the value 3.14 is actually replaced in the code before the code is compiled. This means that any error messages will refer to "3.14" rather than PI (as the compiler, which generates the error messages, has no idea what PI is). Any error messages about the const pi will refer to that rather than some mysterious 3.14.

Ok, this is a fairly small reason to use const, and to be honest I never remember getting into too much trouble using a #define for a single value, but I'm prepared to bow down to Scott Mayers advice on this one and so I wanted to include it for completeness. The real meat as far as I'm concerned is still to come.

I'll now talk a little about const when declaring functions. Function declarations can put const in a number of places, as can be seen from the following function that accepts and returns an integer:

int functionName(int x)const; //(1)
int functionName(const int x); //(2)
const int functionName(int x); //(3)

And here is what each of them do:
Number (1) - When declaring a member function, if it is declared in this form then that function cannot change any of the member variables belonging to that class. It is basically like all of the member variables become const for the duration of the function. So the following will not compile:

class SomeClass{
public:
void changeValue(int x)const;
private:
int someValue;
};

void SomeClass::changeValue(int x)const
{
this->someValue = x; //Error - someValue is const for this function
}

This is very useful for member functions like 'getters' which should only read member variables and not write to them. But of course it can be used for any member function that you want to put this restriction on. Of course, careful programming would remove the necessity of this, but for the price of typing 5 letters you get the added security of knowing you won't change any member variables by mistake.

Also, note that using const in this way is only useful on non-static member functions, so using it on a static member function (or a function outside of a class) will result in a compiler error.

Ok, on to Number(2): This says that the function cannot change the value x that is passed to the function cannot be changed. This is useful if you are passing in a value and you want to be sure that it will not be changed by mistake. See the following code:

void someFunction(const int x)
{
int y = x; //Fine - const->non-const copying is fine
x = x + 1; //Error - cannot assign to a variable that is const
}

Finally, there is Number(3): The first time I saw this I thought - "This must mean that when you return the const int from functionName(), that the return value cannot be changed". But this is wrong, wrong, wrong.... and when I thought about it it makes sense. What is actually happening when you return a variable from a method by value (as we are doing here)? Well, we are essentially copying the value from within the function to the value outside it. And copying a const value to a not const value is fine.

So, in the form that it is on, statement (3) doesn't really do anything. i.e. this is perfectly legal;

const int someFunction()
{
const int x = 5;
return(x);
}
int main(int argc, char* argv[])
{
int y = someFunction();
y++; //fine.
}

This is basically the same as:

const int constInt = 5;
int normalInt = constInt; //Fine, copying the value
normalInt++; //Fine

Where declaring the return type const is only really useful if the return value is going to be directly operated on, i.e. when returning references or pointers. So, for example, if you had a class set up as so:

class SomeClass{
public:
const int& getTimesCalled(){return(SomeClass::numTimesCalled);};
int& getTimesCalledNonConst(){return(SomeClass::numTimesCalled);};
private:
static int numTimesCalled;
};
int SomeClass::numTimesCalled = 0;

With a const and non-const returned reference to numTimesCalled, the following behaviour is observed:

int main(int argc, char* argv[])
{
SomeClass someClass;
int retVal = someClass.getTimesCalled(); //Legal - the value of numTimesCalled
//gets copied to retVal
retVal++; //Legal, retVal gets incremented by 1, numTimesCalled uneffected.

retVal = someClass.getTimesCalledNonConst(); //Same as above
retVal++; //Same as above

int &refRetVal = someClass.getTimesCalled(); //Illegal, cannot assign to a
//non const reference...

int &refRetVal2 = someClass.getTimesCalledNonConst(); //Legal
refRetVal2 ++; //Legal, but changes the value numTimesCalled (in this
//case undesired)

const int &refRetValConst = someClass.getTimesCalled(); //Legal
refRetValConst ++; //Illegal, changing assign to a variable that is const.
}

So, in this case the const return value is preffered as it stops outsiders changing the numTimesCalled variable.

It also has the added advantage of stopping nasty behaviour like this:

someClass.getTimesCalledNonConst()++; //Whoa, changes
//numTimesCalled, definitely undesired!!!

someClass.getTimesCalled()++;//Illegal cannot assign to a const (desired)

See I told you const was great!! That's all for now.

--
Lurning Man
--

Monday, February 4, 2008

First post

Wow, first (and possibly last, I'm pretty lazy...) post. This is exciting.....

Ok, this is the most selfish reason ever to create a blog. I'm not one of these guys that wants to give something back to society through the medium of blogging or anything. Nuts to that. This is all about me!

So, first the back story, and then I'll let you in on the secret of the blog (or you could just read the "About me" part on the right, but that would spoil everything...)

I've been programming for a while, I did electronic engineering in college, so did bits of C and Java there (and some other stuff of course, but this is about programming). Anyway I didn't really feel like I'd had my fill of college after the degree and so I went on to do a PhD. Doing a PhD was great, I'd recommend it to anyone with an interest in research.

However, in terms of learning to program, academia is not the place to be, at least in the area that I did my PhD (digital video analysis). I mean, I did a lot of programming (mostly in a hybrid of C and C++), but you only ever do enough to get the job done. In research, all you ever want to do is prove an idea works. In general it doesn't matter if it works efficiently or not, as long as it works. So you get a good idea of how to solve problems etc. but it is usually in a "I'll fiddle around with this until it works" kind of way. If it crashes every second time who cares? If it is the single most inefficient thing ever created, well, what's the problem with that?

Even in the processor intensive area of digital video analysis, where you are often processing gigabytes of data, it largely doesn't matter how long the processing takes. Program running slow?? It doesn't matter, just leave it running overnight. It's 3 in the afternoon for God's sakes, there are bars to go to. Just leave the program going and you can get back to it tomorrow.

And therein lies the problem. For, I have now ventured into the real world of software creation. And what a big bad world it is!!

I must admit, when I decided to leave academia I kinda thought "I'll show these corporate whores a thing or two". Well, I may have been wrong on that one.... In the first couple of months of 'real work', the main thing I've learned is that I actually know very little about programming.

So, as for the secret of this blog. Well, as I said, this is a selfish blog. I aim to chronicle my journey from a basic programmer to (hopefully) an accomplished one. Whenever I find out something that I didn't know before, I'll post something up here for everyone to marvel at how silly I am for not knowing it in the first place. Thus, the basic aim of this blog is to re-enforce my own learning.

It won't be pretty, but it might be useful (for me)....

How to become a programmer in 21 years