Basser Department of Computer Science
Departmental evaluation of the PBL trial - semester 1 1996

Tony Greening, Judy Kay and Jeff Kingston

Introduction

In 1997, this department will make dramatic changes to the CS1 course because the move to an object-oriented programming language will alter the overall approach to our teaching as well as the details of all the learning tasks of the course.

This document addresses another set of changes to our teaching that we believe have the potential to contribute to a first year course that is exciting, effective and innovative compared to traditional computer science courses.

Our earlier documents proposed a shift to teaching first year in an object-oriented language and a Problem-Based Learning (PBL) style. We will not repeat their summary of our motivations and plan. Instead, this document briefly describes how we taught in our PBL style in 1996, how we would recommend this to change in 1997 and a summary of the overall results of our semester 1 evaluation.

We are gaining an increasing sense of the timeliness of our proposal, not only from our growing dissatisfaction with the current style of teaching but also from the increasing evidence of similar activity. In the Vice Chancellor's Forum on Teaching: The First year Student Experience at this University (6 September, 1996), the Keynote Speaker, Richard James, of the University of Melbourne, reported on a CAUT commissioned project which identified major problems experienced by first year students and summarised several recommendations, including the following (paraphrased from his presentation) which are relevant here:

Our Vice-Chancellor, Professor Gavin Brown, spoke of the importance of first year teaching, shown by involving senior staff and by creating curricula that allow students to develop their particular talents as we cannot do in large lecture classes and rigidly fixed learning experiences. The shift we propose is precisely in line with his sentiments.

This department has had many fine teaching initiatives, including several at first year level, and the authors of this document take considerable pride in having spearheaded a number of these in the past. We are also delighted at our success in achieving the move to an object-oriented programming language for first year, one of the central tenets of our original proposal. We now present the outcome of our evaluation of PBL for teaching CS1.






Executive summary

We have now completed the one semester trial of Problem Based Learning (PBL). This document reports briefly on evaluation of the trial. This section summarises the evaluation outcomes. For detailed information on how the course was run, how we would recommend it be changed for full implementation and for detailed data, see the later sections of this report. In particular, for the detailed answers to the 10 questions for the Courses Committee, see the Appendix to this report. We have listened carefully to staff concerns and combining these with the specific questions that are to be answered for the Teaching Committee, we have identified two central concerns: The executive summary deals only with these two questions.

Will students, especially the weaker students, learn as well in a PBL format?

Analyses of the examination results show no statistically significant difference between the PBL students and conventionally taught cs1 students in overall performance. Details are available in Question 2 of the Appendix which currently includes two sets of analyses.

In addition, we are aware that some staff believe that weaker students will be unable to cope with PBL. We have produced a scatter diagram of the performance of all CS1 students against their TER, the only readily available measure of `ability'. In the diagram below, we show the PBL student's marks as a square and other cs1 students as a dot. There is no evidence that weaker (low TER) students perform less well in a PBL learning situation, this inspite of the fact that five of the PBL students have TERs below 70, our usual cut-off.

Can we resource PBL, especially the teaching effort?

There are two major roles in teaching PBL: the section leader and the class tutor. A section leader will be a senior member of staff and their role carries similar levels of responsibility to that of a lecturer for a module. The section leader must prepare and deliver the 1-hr class attended by all students in the section and maintain administrative details for students in the section, teach one class of approximately 20-24 students, with 5 hours contact for the first four weeks of semester, 4 hours contact for the next four weeks and thereafter 3 hours contact per week.

Overall, our calculations show the staff costs of teaching PBL are similar to those of the current cs1:

The detailed workload calculations appear in question 4 of the Appendix. We have ignored the common costs of the two systems (like the practical examinations). We have also ignored set up costs, though our experience in the 1996 trial convinces us that the set up costs for PBL will be lower for the new Blue materials. Finally, we have ignored the costs of the cs1a class in all calculations and omitted them from all analyses in this report.

Inherent in these calculations is the fact that costs predictions are harder than summarising actual costs and even that is not easy! Having said that, we conclude that costs of the two systems is comparable. We also note that if budgets are tight, PBL costs could be reduced by 560 hrs simply by having 6 in each group: this group size was quite workable this year and is widely used for PBL groups elsewhere.


















































Overview of cs1-pbl 1996

The activity in semester I may be logically divided into three phases. The first phase involved an orientation period in which students were introduced to the nature of PBL, and were required to learn a set of foundational skills in use of Unix to manage files and directories as well as for electronic communication, first programming in pascal and working in groups. This phase involved a period of three weeks, during which they were given two problems (and associated initial resources). Students were asked to create a homepage and then write a minimalist UNIX manual which would be suitable for beginning computer science students learning Pascal.

Writing a homepage engaged the students in an exciting aspect of computing, placing the Internet at their feet (something of an expectation in today's CS students). It also introduced a useful skill that we could rely on during the semester, asking students to attach reports to their home page, for example. It was also part of establishing skills in electronic communication to be used throughout the semester. Importantly, it served as an excellent way for students in the class and the tutor to learn about each other.

Writing a minimalist UNIX manual ensured that students were in a position to manipulate the environment in which they would be programming.

An important feature of this phase, in view of the fact that it occurs in a period of potentially volatile student enrolment, is that it uses fluctuating group membership; i.e., the tasks are essentially individual ones, but are tackled in a group environment which is deliberately varied in each class. This fits in well with concerns about group membership in this period, as well as providing a period when students could be introduced to the group learning environment.

Phase two consisted of a large problem in which groups were asked to choose between a (classical) cryptology scenario and one involving a video-shop database. The period lasted from week 4 to week 11, inclusive. At this stage, long term groups were established and students were required to develop and implement plans for researching, designing and coding solutions to the chosen problem. Pace was maintained by a requirement to provide reports on their progress and activities like code reviews where all members of the group and the tutor did a typical software engineering code review. This introduced a technique that is currently acknowledged as the most effective means of improving code quality, one that is not available in the conventional learning style where this activity would constitute cheating. Cryptography groups were encouraged to challenge each other by placing encrypted text on their home page as incentive for other groups to try and break them. Groups in this period were observed to be highly resilient to any adverse conditions. Note that each individual was required to take responsibility for some specific coding aspects.

Phase three was a period in which students reflected on their learning, and presented their work to the rest of the class and for assessment. The presentation consisted of reports documenting their work (which included individual reflective statements), submission of code (tarred with demo shell scripts and test data), and an actual demonstration of their product to the class (and to invited members of staff). During the lead-up to the demonstration, other activities were introduced as a basis on which their presentation might proceed; for example, cryptography groups were required to submit encryption code which was used to apply to large pieces of previously-unseen text, with the resulting pool of encrypted text being made available for them to attempt cryptanalysis. This was effectively a period in which their code was tested on further "real-world" problems, and was generally a rewarding period in which students were motivated by the opportunity to make use of the code they had written.

The practical work just described formed the motivational basis for student progress throughout the semester, and was directly supported by a weekly three-hour workshop period. A further (non-contiguous) two hour period was used for practical activities designed to facilitate the development of certain skills important to success in the main problem-based activities. This time was used to reinforce a range of useful skills, and not constrained to technical ones; for example, self- assessment activities took place in these sessions. Students were encouraged to be less teacher-reliant via a number of activities which involved working together in groups (these groups being different from those used in their problems). Technical material was often combined with this work; for example, one of their first exposures to Pascal was to be asked to do the first prac exam in small groups. This is typically PBL, in that the problem drives the learning rather than concludes it; for many, it de-mystified the programming task ahead, and strengthened their commitment to group processes as they jointly managed to work through a problem which initially they seemed to assess as being beyond their capabilities. Further support to the main problem-solving task was given in the 1 hour seminar slot used each Friday. This was sometimes used to present subject-matter - PBL is not discovery learning, and a seminar on certain aspects of coursework is just another resource that student would use. However, more often this one-hour session was used to discuss either conceptual matters which provide orientation to the task or consolidation of major concepts that had been the focus of the two-hour period of work. Such topics included things like problem solving, group work, programming style and testing strategies. Syntactical issues were not explicitly dealt with by a lecturer, although guidance was given as to which aspects students should be familiar with. For example, rather than lectures on procedures and functions, a session was devoted to modularity as a concept, and this was motivated by and linked to working as an individual in group contexts.

Modifications we recommend for 1997

Our first attempt at this may be judged as successful in terms of the criteria presented by the department; however, another success is that we have learnt how to do this better through the experience of the trial. This section documents some of the mistakes we have made and recognised.

More self-assessment tasks would be included in future programmes. We knew that this was important since many PBL courses identified this as a need. We included a number of these (such as a Pascal "checklist" which allowed students to self-assess the Pascal coverage they had achieved, short quizzes and longer take-home exams and self assessment forms for submitted work, in which they applied the marking scheme to their own submissions). These proved valuable, but in retrospect we do not feel that we extracted maximum value out of the device. As well as providing a useful mechanism for encouraging some degree of metacognition from the students, they offer a means for introducing structure to the course in a way that does not overly impose on student ownership of the learning task.

Another example of a change that would be incorporated in a 1997 implementation of PBL is to strive for a greater degree of incorporation of supplementary learning tasks. For example, we did a nice UNIX-based activity which was problem-based and students found a useful task; however, we later realised that we missed an opportunity for this to be framed in a wider context, such as program testing. A greater degree of task amalgamation would provide more realistic learning scenarios and could be used to reinforce the value of certain concepts (such as testing).

Finally, we have to emphasise that this was a first trial of PBL for a first year computer science course. Just as the shift to Blue will require a good deal of learning by staff and some blind-alleys, so this year's trial has given us considerable and valuable experience in applying PBL to first year computer science. At the very least, the administrative aspects will be improved in light of that experience. For example, the students were very confused about what was required of them: we had provided a document that gave detailed assessment criteria and we had devoted one class to having students apply the code assessment to some of their own code. But students were still unclear. We can see simple administrative and organisational ways to improve this. Similarly, we managed some of the marking rather inefficiently this year and only as the course progressed did we develop ways to give better feedback more efficiently.

Certainly, one important aspect of the approach will change in 1997, and that will be the change of language to BLUE. This is not insignificant on its (positive) impact on PBL. Our initial proposal included a proposal for change to an object-oriented language as a natural basis on which to deliver a course oriented towards groupwork (as PBL must be).

Acknowledgements

Kathryn Crawford has been part of the group working on the trial and managed the educational evaluation. She was unavailable at the time of writing this report so we could not include her as an author but want to acknowledge that she had an important role in the processes. We are also grateful to the research assistants who worked with her on various aspects of the evaluation, including some valuable qualitative evaluation. They are Sharon Dowsett, Kirsten Hogg and Melinda Holmes.

We also thank Bob Kummerfeld for his support through semester 1 and especially during the writing of this report when he took over teaching duties to permit the considerable time needed to prepare this report.

This report includes calculations by Ross Quinlan and graphs by Greg Ryan. We are grateful to them for helping in what has proved to be a large task.




























Appendix: Detailed Answers to the Questions Defined by the Courses Committee

This Appendix reports on each of the conditions that have been specified for the Department to adopt PBL for CS1 in 1997. As in the document approved by the Courses Committee, each condition is accompanied by a statement of the data to be provided. Then follows the actual data.

1. The 1996 semester 1 trial is a fair approximation of what is envisaged for 1997

We document each of the following: similar class sizes, student TER profile, staff and other resources, assessment regime

The trial was substantially similar to the envisaged 1997 course except that the overall numbers were far smaller, timetable constraints meant that additional classes were run for the 1-hr session and we made more use of the PBL-lab than in our initial proposal.

However, we should note that we were constrained to match the coursework for the current cs1 course; we believe that other changes agreed for 1997 (adoption of an O-O language and greater emphasis on practical skills of programming as discussed in recent meetings) will mean that the 1997 course will be better matched to PBL parts of the trial.

We also note that the trial operated under difficult conditions that operated against the effectiveness of the teaching. The worst problems were in the first weeks when timetable problems (including changes) caused chaos for many students. A set of quite serious problems resulted from our inexperience in designing PBL-style courses: we did not include all the right check points at the right places and we sometimes designed materials in old-style tutorial format (with students seeing them as irrelevant at the time they did them). The students also knew this was a trial and frequently compared themselves with the other class and delayed engaging with their work.

Similar class sizes

The 35 cs1pbl students attended

In summary, only the 1-hr class was different from the 1997 plan.

Student TER profile

Analyses of the two classes indicate that they do not differ significantly in TERs. Actual TER mean for PBL group is 81.2 (sd = 9.6) compared with the main class mean 80.9 (sd = 9.2). Null hypothesis is that means for both groups are the same, and this fails to be rejected at 0.05% level of significance.

Staff and other resources

The plan has groups of 100 students managed by 1 co-ordinator. The trial had 35 students in total and Tony Greening and Judy Kay shared the role of section co-ordinators. As planned for the co-ordinator, Tony also taught 1 class for the 2-hr and the 3-hr sessions (but as described below, from mid-semester he attended only the first hour of the 3-hr class). The other set of 2-hr and 3-hr sessions was taught by a cs4 student, Kevin Irwig. He was chosen so that we would have the opportunity to include casual staff in the trial. Total staff resources were affected by needing to match the requirements of the main class to study aspects like number representation that did not fit the problems we had chosen. In summary, the trial had smaller numbers overall and required additional time in preparation this year compared to a steady state implementation.

Other resources

nil to report.

Assessment regime

The detailed marking sheets are available. We consider these suitable for full class implementation.

2. PBL students learn what we intend the regular CS1 class to learn as well as the regular class does

Analysis of student' performance on the Semester 1 written and practical exams matched with regular students of similar TER who volunteered for PBL but were randomly excluded.

Analysis for a matched group of cs1 students is shown in the table below. Overall, there was no statistically significant difference on the semester 1 exam as a whole (using any of the methods of calculation). The students in the pbl trial were matched on the basis of TER, previous background and responses to a week 3 questionnaire about approaches to learning. Data entry problems meant analyses did not take account of people volunteering for pbl as intended.

The table below gives all statistics calculated for the semester 1 exam. The first column shows the page number. The second column shows Ross Quinlan's assessment of how much problem solving is assessed on that page: ** are regarded as largely problem solving and * have some problem solving.

The three right hand columns show all the statistical calculation so far done on this data. The third column gives the means and Z value (using Wilcoxon rank test - also known as the Mann-Whitney test) The fourth and fifth columns give 2-tailed t-scores for Null hypothesis: that the change to pbl-style cs1 had no effect on exam performance.

The differences between the three columns are:

Positive values occur when the non-pbl group have higher means than the pbl group. Asterisks (*) mark the cases where the differences are statistically significant.

The t values are given for two tailed t-tests. The fourth column t-scores generally have 74 degrees of freedom and this means that the two means are significantly different if |t| >= 2.00 (where |t| is the absolute value of t). The exceptions are H1 and H2, questions that were not completed by all. These have degrees of freedom of 35 and 39 respectively, so in both cases |t| >= 2.042 in order for the means to be significantly different. Thus the values near one are definitely not significant.

In the fifth column t-scores, the sample sizes were larger (as the whole of CS1 were being compared. Degrees of freedom for the total exam mark - S1E were 396. Here to be statistically significant |t| >= 1.98.

The differences marked are significant based on p>0.05.

The last row of the table shows that there was no statistically significant difference in overall exam results between the PBL group and either the matched control or the whole classes.

As already noted in Question 1, the groups do not differ significantly in TERs.

Page Problem Z t-score
Number Solving   matched all
2   0.683 0.96 0.89
3 ** 1.325 2.36 * 1.56
4 ** -0.298 -0.79 -0.52
5 ** -0.247 0.53 0.16
6   -0.583 -0.26 0.14
7   2.504* 2.57* 2.48*
8 * 0.495 -0.46 0.36
9 * -2.067* -1.47 -2.50 *
H1 ** -1.139 -0.41 0.06
H2 ** -1.772 -1.37 -1.04
overall
written exam
  0.239 0.41 0.33

On all three measures, the cs1 class performed significantly better in page 7. The question on page 7 required students to draw `an expanded diagram of a typical' nested record structure, something that had been done in lectures but was not seen by PBL students.

The third and fifth columns shows the PBL group performed significantly better on page 9.

The results for the practical examination show are:

and the actual marks awarded are Using a t-test, with null hypothesis that the means of pbl and cs1 were equal, a t-test indicates that the null hypothesis is not rejected at 0.05 level of significance.

3. PBL students are better than regular cs1 students at some additional skills that will be useful to them in later years, such as group work, problem solving, and independent learning.

We will present an analysis of examination results on problem-solving questions, and use a class-wide survey to assess attitudes to independent learning and similar skills.

Since PBL students were required to do many things unrelated to the main class assessment, these developed skills not normally part of courses until cs3. Students were required to write reports, devise their own convincing testing of code, put together code written by different students, demonstrate their work and, importantly, develop skill in planning and critically self evaluating their work and their planning. All these contribute to the generic skills that the University requires us to nurture in students.

In particular, the PBL course developed several skills our initial proposals identified as important to long term success in computer science as well as those in the University's generic skills. These include:

The novelty of these goals meant that we didn't do a perfect job of them this time but that we learnt a lot and have clear ideas of how to improve our running of the course in future. The critical point here is that the nature of PBL makes many of these skills natural and motivated and so students take them on as part of work on the problem.

4. Staff delivery cost per student is comparable

We will present our records of the actual delivery cost of both systems.

These costs do not account for the initial set up cost but do account for the continuing long term costs.

The following pair of tables show calculations for the cost of delivering cs1 in PBL-style compared to the current style for semester 1. It largely follows the original proposal submitted to the courses committee but has been amended to reflect our experience this year.

Some terminology needs clarification:

the tables omit aspects that are constant over both systems:







Roles Description PBL Costs in hours
    Cost Calculation Cost
Section leader for 100-120 students
5 sections for cs1 class (exclude cs1a)
1-hr/wk class
4-hrs/wk prep/admin
mark group demos at end semester
5 hrs/week * 14 weeks
+
5
= 75hrs/section/semester
375
Class tutors each 20-24 students,
for 5 classes,
including one by the section leader
and 5 sections
1-hr/wk prep (except for 7 repeat classes)
2-hrs/wk class
3-hr class
tutor present for
3-hrs, 1st 4 weeks
2-hrs, next 4 weeks
1-hr, 6 weeks
Cost of 1 class/semester=
1 hr/wk * 14 = 14 prep
2 hrs/wk * 14 = 28 2-hr class
6 hrs/wk * 4 weeks + 5 hrs/wk * 4 weeks + 4 hrs/wk * 6 weeks = 68 3-hr class
= 68 hrs/semester/class

110 hrs * 5 classes/section * 5 sections = 2750 hrs
less 7 repeat classes * 14 weeks = 98 hrs

2652
Marking groups reports 20 groups per section @ 15 mins/report
marking one draft and final
20groups * 0.25hrs * 5 Sections
* 2 markings
50
Marking individual reports 500 students (assumes no dropouts)
marking one draft and final
@ 15 mins
500 students * 0.25hrs/report
* 2 markings
250
Overall co-ordination
design assessment schemes
standardise marking
write examination
chair exam meeting
estimate 0.1 per semester 15 hrs per 0.01 load 150
Grand total for these aspects     3477

Those who have lectured cs1, giving 3 lectures per day, 3 days per week for just over 4 weeks will attest that this is an onerous load. This would essentially be replaced by the role of Section leader who will take the 1-hr Section's combined class plus one Class.

Roles Description CS1 costs as currently taught
    Cost Calculation Cost
Lecturers 3 * 0.12 load
ignore new lecturer load
3 lecturers * 12 * 15 hrs per 0.01 load = 540 540
Practical classes
start 31 classes
27 from mid-semester


Extra help
weeks 1-3

1-hr/wk prep (except for 11 repeat classes)
1-hr/wk meeting
1-hr/wk tute
2-hrs/wk lab

Extra help

5 hours * 31 classes * 7 weeks = 1085
- 11 repeat classes * 7 weeks = 77
= 1008 hrs for 1st half semester
+
5 hours * 27 classes * 7 weeks = 945
- 9 repeat classes * 7 weeks = 63
= 882

31groups * 2hrs * 3 weeks = 366 hrs

2333
Marking Asst 1
stage 2, 1hr/group
stage 3, 1hr/group
Asst 2
4hrs per bundle
21 bundles
31 +
31 +
84
146
Practical work
automatic grading
0.1 load for a full year 75 hrs 75
Develop tutes, labs
assignments and workbook
0.2 load for a full year 150 hrs for one semester 150
Co-ordination currently
by CS1 Director
to be devolved to section leaders and
person setting exam etc
Allow 0.1 of current cs1 director load 150 hrs 150
Grand total     3244


5. Space and equipment costs will not exceed lg41/2/5 plus two PBL labs like F46.

In fact this has already been demonstrated in experiments conducted by Tim Nicholson. It is a tight fit and CS2 would no longer be able to use LG41--5 for Computer Systems laboratories in Semester 1, raising a serious question about possible overcrowding of LG29 by CS2 and CS3 students. The PBL lab worked well with 8 machines for 20+ students and coped well with equipment failures. This contrasts with the conventional classes where each student needs their own terminal. Although we consider it desirable to have PBL style labs for all our prctical sessions, we have been able to use a conventional tutorial room for the 2-hr classes.

6. Students are not disadvantaged by changes in group membership.

We will present our records of group changes with an analysis of the relative performance of students in changing and constant groups.

There were no changes in group membership. This might be attributed to the process we used to form groups and the way the running made it manageable to maintain groups. We note that some groups had students who suffered illness and misadventure and there was great diversity of background, commitment, competence and performance.

Moreover, there were many situations were the groups meant that students were resilient through various difficulties. For example, in one class two of the eight monitors failed: groups just reformed where in a mainstream class, a similar loss would have been a major problem. Similarly, some students had serious personal problems and the group members helped them get back on track.

Note that groups for the main programming task were not formed until week 4 of semester. before that, we encouraged groups that changed membership for each main activity. This was done primarily because there is so much change in the cs1 class in the first few weeks, typically with around 10% of starters leaving and a similar number of late people coming.

7. The course can be fairly assessed.

e.g. without excessive reliance on discretionary marks given by non-staff tutors. No detailed proposal for assessment accompanied the original PBL proposal; this deficiency will be rectified.

The assessment was:

Description Marks %
Semester 1 Written Exam 50
Individual practical work 25
Group practical work 25

Individual practical work was assessed on a portfolio containing the work done by the individual (code, testing etc) as part of the group task. This was assessed much like any other programming assignment might be.

The groups work was assessed from a sequence of reports through the semester and at the end plus a demonstration in the last week. Both these were assessed by the section leader. In a full implementation this would mean that all the group assessment was done by one person for each Section of 100 students.

Full details of the assessment criteria and marks as used are available.

8. Students do not dislike PBL

We will present an analysis of dropouts from the two systems, including a survey of students who drop out, plus the results of surveys of attitudes to the two systems.

Studies of students who dropped out of the PBL class.

During semester 1, there were three cs1-pbl dropouts identified and interviewed by research assistants. The report from the research assistant is:

9. Sufficient staff members a willing to participate

We will spell out the various staff positions under PBL, propose a workload formula for each, and gather expressions of interest.

The roles in teaching PBL are:

To assess staff willingness to participate, an email survey was conducted starting 31st July by Judy Kay. The results are in the table on the next page: The table below summarises lecturer indications of keen/willing-ness to teach pbl/non. People who had not responded after several weeks were mailed again. Note that some people's responses to this mail survey were hard to interpret and this table represents Judy's best efforts to elicit replies from all and interpret them as intended.

It is hard to miss the fact that most of the staff who have had a long term commitment to teaching programming in first year are `keen' to teach using PBL. Only two staff members report being `keen' to teach cs1 in the non-PBL format and one other shows willingness only to teach non-pbl style.

We conclude that the 12 staff already indicating willingness (or keen) are sufficient to run pbl in sem1 1997, with no more casuals than at present.

It was clear from the return mail that several staff do not feel that they know enough about PBL to comment or are awaiting the outcomes of the trial.

We conclude that there is a substantial part of the staff who are keen to teach using PBL and a significant group that needs more information.

10. Other conditions

Other conditions may well suggest themselves, but adequate notice must be given if they require data collection.

The only new matters to present are:

Name Keen Willing notes
  PBL non PBL non notes
allan    yes  
bob yes     
doan   yesyes  
fekete yes  yes  semester 2 only
feng      awaiting more information
ian  yesyesyes  
jeff yes  yes yes  
jimd      awaiting more information
johnr yes     
judy yes yesyes  
michaelw      awaiting trial outcome
michaelh yes     
mik yes yesyes  
nitin   yesyes  
quinlan  yes    
sevinc  yes    
shopwood yes yesyes  
symvonis   yesyes from 1999
tony yes     
wobcke   yesyes awaiting trial outcome