Correlation Formula CORRELATION= is a measure of relationship between two variables. Coefficient of correlation determine validity, reliability and objectivity of an examination prepared. It also indicates the amount of agreement or disagreement between groups of scores, measurements, or individuals. Interpretation of Ranges + 0.00 to + 0.20 –Slight correlation, almost negligible relationship + 0.21 to + 0.40 –Slight correlation, definite but small relationship + 0.41 to + 0.70 –moderate correlation, substantial relationship + 0.71 to + 0.90 –High correlation, marked relationship + 0.91 to + 1.00 –Very high correlation, very dependable relationship Coefficient of correlation Spearman’s Formula: R=1-[(6(ΣG)/N2-1] Where: G=Difference of the two ranked scores N=Number of scores Procedure: 1. Write the scores or measures of the two variable under column x and column y 2. Rank the scores under column x , with the highest score as rank 1 and the lowest score as rank N. Write the ranks of the scores under column Rx which means rank of x 3. Rank the scores under column y with the highest scores as rank 1 and the lowest score as rank N. 4. Subtract the Ry values from the Rx values. Write the difference under column G, means gain. Consider only the positive values. Coefficient of correlation by the use of the Rank-Difference Method: rho=1-[(6(ΣD2)/N(N2-1)] Procedure: 1. Follow the same steps from 1 to 3 in the Spearman’s Formula. 2. Find the difference between the two steps of ranks or values under column Rx and Ry. Subtract the larger value from the smaller value. 3. Write the difference of Rx and Ry under column D, which means difference. 4. Square the difference, D and write under column D2. 5. Get the sum of the values under D2. Coefficient of correlation by the Product-Moment Method: rxy=Σdxdy/square root[(Σd2x)(Σd2y)] Procedure: 1. Get the total of the data under test x and test y and find the mean of x and mean of y. 2. Get the deviations dx and dy by getting the difference between the mean and the scores. 3. Square dx to obtain d2x and dy to obtain d2y. 4. Get the summation of each. 5. Get the product of dx and dy to have dxdy 6. Get the summation of dxdy. Posted by Statistics_Lecture at 9:10 AM 0 comments Labels: correlation
Monday, February 2, 2009 Set Operations 1. With any two sets “A” and “B” there is associated a third set “C” satisfying the property that C = { X/X Є AV x Є B } In words: “C” is equal to X, such that X is belong to “A” or X is an element of B “C” is called the union of “A” and “B” we denote the set C symbolically as C = A U B Example: A = {3, 4, 5, 6, 7} B = {2, 4, 6, 8, 10} A U B = {2, 3, 4, 5, 6, 7, 8, 10} 2. With any two sets A & B there is associated A third set “D” satisfying the property that D = { X/X Є A ۸ X Є B} In words: D equals X such that X is an element of set “A” and x is a member of B. “D” is called the intersection of sets A and B, and we denote the set D symbolically as D =AB 3. With any two sets A and B there is associated A third set “C” satisfying the property that C = { X/X Є A ۸ X € B}. We denote the set symbolically as C = A – B, and call C the relative complement or difference of A and B. Example: A = { a, b, c, d, e, f} B = { a, e, i, o, u} A – B = {b, c, d, f} and B – A = {i, o, u} 4. If A is a subset of U, then the set of an elements contained in U that are not elements of A is called the complement of A in U and is designated by Ă then Ă = {X/X Є U ۸ X € A} Example: Consider the universal set of an counting nos. and the set A of counting numbers less than 100 then U = {1, 2, 3, 4, ……} A = { 1, 2, 3,……99} Ă = {100, 101, 103…….} 5. The set product or cartesian product of two sets A and B is the set of an possible ordered pairs (a, b) where a is in A and b is in B. We symbolize this set of ordered pairs
by A X B and write, A X B = {(a,b) / a Є A ۸ b Є B } Example: If A = {1, 2} and B = {x, y} then A X B = { (1, x), (1, y), (2, x) , (2, y}} and B X A = {(X, 1), (X, 2), (Y, 1), (Y, 2)} Posted by Statistics_Lecture at 10:35 PM 1 comments Labels: set operations
Tuesday, January 27, 2009 Kinds of set 1. Finite set – countable Example: Sets A, B, C, D are finite sets 2. Infinite set – uncountable Example: Set E is an infinite set 3. Empty or null set – has no element Example: A = { } 4. Equal set – set A and set B are equal set if the elements of set A is exactly the element of set B. Example: A = {set of an even counting number of one digit} = {2,4,6,8} B = {set of an integral multiples of two having one digit = {2,4,6,8} 5. Equivalent set – two sets are equivalent if there exists a one-to-one correspondence between elements of the two sets. Example: A = {1, 2, 3, 4,5} - x coordinate B = {6, 7, 8, 9, 10} – y coordinate then “A” is equivalent to B. We can construct the relation of set A and set B. { (1,6}, (2,7), (3,8), (4,4), (5,10) } 6. Subset – set whose elements are members of the given set A = {1,2,3,4,5,8}, B = {2,4,8} 7. Universal Set – totality of the given set with consideration. The set from which we select elements to form A given set is called universal. Example: Set A = {1, 2, 3, 4, 5, 8} is a universal set
Set B = {2, 4, 8} is a subset of set A 8. Disjoint Set – sets that has no common element ; if two sets have no element in common, the sets are called disjoint sets. Posted by Statistics_Lecture at 5:03 PM 0 comments Labels: kinds of set
Friday, January 23, 2009 Methods of Writing Set
Example: Roster Method and Rule Method Methods of Writing Set 1. Roster or tabular method The elements of the set are enumerated and separated by comma. 2. Rule method or set builder A, descriptive phrase is used to describe the elements of the set Posted by Statistics_Lecture at 6:03 PM 0 comments Labels: roster method, set builder
Monday, December 29, 2008 Sets Definition and Examples Set
Definition: Set is a well-defined collection of things or objects Note: Sets maybe denoted by capital letters such as A,B,X, Y An element or member of a set is a thing that belongs to the set and maybe denoted by small letters such as a,b,c……..x,y. The members of the set are enclose in braces { }, with a comma separating the members. Example: The set “A” whose members are ETHEL, CYNTHIA, CHELO, we usually use the symbol. A = {ETHEL, CYNTHIA, CHELO} ETHEL Є A - Read as ETHEL is an element of set A - Read as ETHEL belongs to set A - Read as ETHEL is a member of set A Posted by Statistics_Lecture at 8:56 PM 0 comments Labels: set
Wednesday, December 17, 2008 Empty Set and Set - A set is a collection of things - An element or member of a set is a thing that belongs to the set. * There are many words which we use in everyday language that have the same meaning as the word set. Example: 1. A herb of cattle is a set of cattle 2. A flock of birds is a set of birds 3. A squadron of planes is a set of planes 4. a school of fish is a set of fish 5. A regiment of soldiers is a set of soldiers * The members of the set are enclosed in braces, { }, with a comma separation the members and to identify sets we often name them by capital letters.
Example: 1. The Set “A” whose members are Ethel, Emerson and Merecel. We usually use the symbol A = {Ethel, Emerson, Merecel} 2. The Set “B” of days of the week B = {Sunday, Monday, Tuesday, Wednesday, Thursday, Friday, Saturday} The set C of words to distinguish two faces of a coin C = {Heads, Tails} * The set that has no elements is called the empty set, we use the symbol Ǿ to indicate the empty set. Example of Empty set: 1. the set of whole numbers by 9 and 10. 2. the set of four-sided triangles. 3. the set of astronauts who have landed on the planet Pluto 4. the set of icebergs in the sahara desert 5. the set of people with two heads 6. the set of pink elephants 7. the set of purple cows Posted by Statistics_Lecture at 6:10 PM 0 comments Labels: empty set, set
Tuesday, December 16, 2008 Permutation Formula and Example Permutation Each different arrangement or ordered set of object is caused a permutation of those objects. - if A = {a1, a2, a3……. An} is any set of n elements then any arrangement of the elements of “A” along a line is called a permutation of the elements of A. All the permutation of the elements of the set is given by the formula: P = n! where n = no. of elements
Problem: How many permutations can be made from the word PINOY” Solution: PINOY – consist of 5 letters P = 5! = 120 permutations The total no. of permutations of n objects taken r at a time, P(n,r) is given by the expression. P (n,r) = nPr = n!/(n-r)! Problem: Find the no. of permutations of the four integers 1,2,3,4 taken two at a time. Solution: n = 4, r = 2 4P2 = n!/(n-r)! = 4!/(4-2)! = 4!/2! = 4.3.2.1 / 2.1 = 12
Statistics Probability Sample Problems 1. At a certain canteen, Doris can choose merienda from three drinks (Coke, Pepsi, Gulaman) and four sandwiches from (bacon, chicken, tuna, egg). In how many ways. Solution: D = {Coke, Pepsi, Gulaman} N(D) = 3
S = {Bacon, Chicken, Tuna, Egg} N(S) = 4 N1 . N2 = 3 x 4 = 12 ways 2. Two dice are rowed, in how many ways can they fall? If 3 dice are rowed? and if 4 dice are rowed? For two dice N1 = 6 N2 = 6 N1.N2 = 6 x 6 = 36 ways For three dice N1 . N2 . N3 6 x 6 x 6 = 216 ways For four dice N1.N2.N3.N4 6 x 6 x 6 x 6 = 296 ways 3. Using the digits 1,2,3,4,5,6, How many two-digit can be formed if a) repetition is allowed b) repetition is not allowed. How many numbers do we have to choose from the given set, they are 6 numbers. Solution: a) Repetition is allowed 6 x 6 = 36 ways b) Repetition is not allowed 6 x 5 = 30 ways Posted by Statistics_Lecture at 2:03 AM 0 comments Labels: probability, probability sample
Sunday, December 14, 2008 Statistics Probability: Definitions, Principles and Samples Probability which connotes the “chance” or the “likelihood” that something will happen or occur is an interesting and fascinating area of mathematics. Probability – the part of mathematics that deals with the questions “how likely” is called probability or the theory of probability. Probability – is a measure of certainty, its scale is from 0 to 1. A probability of zero
indicates that there is no chance at all that an event will happen or occur. A probability of one (1) indicates absolute certainty that an event will happen. Absolute certainly rarely happens in lifes. 1. Experiment Activity that can be done repeatedly. Examples: 1. Tossing a coin 2. Rolling a pie 2. Sample Space – set of all possible outcomes in an experiment(s) Examples: a.) S = {H,T} n(S) = 2 b.) S = {1,2,3,4,5,6} n(S_ = 6 c.) S = {Rod, Ed, Emer} n(S) = 3 3. Sample Point – an element in the sample space Examples a.) H is a sample point T is a sample point 4. Event – is a subset of sample space Example: Getting an even number when you roll a die is an event S = {1,2,3,4,5,6} E = {2,4,6} n (E) = 3 Counting Techniques N1 . N2 . N3 . N4 …..Nn (where N = event) Fundamental Principles If one thing can be done independently in N1 different ways and if a second thing can be done independently in N2 different ways and so on. Then the total number of ways in which all the things may be done in the stated order is N1 . N2 . N3 . N4 ………. Posted by Statistics_Lecture at 12:03 AM 0 comments Labels: probability, probability sample, statistics probability
Saturday, May 31, 2008 Mode Mode = is that single measure or score which occurs most frequently. When data are grouped into a frequency distribution, the crude mode is usually taken to be the midpoint of that interval which contains the largest frequency. When to use the mode: 1. When a quick and approximate measure of central tendency is all that is wanted. 2. When the measure of central tendency should be the most typical value. Finding mode from the ungrouped data: Example: 1. A set of numbers 11, 12, 13, 16, 16, 16, 19, 20 has 16 as the mode. 2. A set of numbers 45, 49, 52, 55, 58 has no mode. 3. A set of numbers 4, 4, 6, 8, 8, 8, 9, 9, 9, 10 has modes of 8 and 9 and is called bimodal. Mode of grouped data To determine the mode of grouped data we have to find first the modal class. In a frequency distribution, the modal class can be easily determine by inspection as it is the class with the highest frequency. Mo = Lmo + [ d1/d1 + d2 ] c Where: Lmo = lower boundery of the modal class d1 = difference between the frequency of the modal class and the frequency of the class next lower in value. d2 = difference between the frequency of the modal class and the frequency of the class next higher in value. C = class size Find the mode of example 3.8. table 3.1 Weekly wage ( in peso) f Lower class boundary P 870-899 4 869.5 900-929 6 899.5 930-959 10 929.5 960-989 13 959.5 990-1019 8 989.5 1020-1049 7 1019.5 1050-1079 2 1049.5
Mo = 959.5 + [ 3/3+5 ] 30 = P 970.75 Posted by Statistics_Lecture at 3:21 AM 0 comments
Median b.) Median = in the midpoint of the distribution. Half of the values in a distribution fall below median and the other half fall above it. When to use the median: 1. When the exact midpoint f the distribution is wanted the, 50% point. 2. When there are extreme scores which would markedly affect the mean. Extreme scores do not disturb the median. 3. When it is desired that certain scores should influence the central tendency, but all that is known about them is that they are above or below the median. - determine of whether the cases fall within the lower halve or the upper halve of a distribution (appropriate locator of central tendency). Finding the median from ungrouped data: 1. When N is odd, the median is the middle score. Ex. 20 15 13 11 8 7 6 There are 7 scores and the median is 11 2. When N is even, the median is the average of the two middle score. Ex. 21 18 15 14 11 8 8 7 There are 8 scores and the median is (14+11)/2 = 12.5 3. When several scores have the same value as the midscore. Ex. 15 15 14 11 9 9 9 6 5 Median is 9 Ex. 1. Find the median of the following set of observations. 8413&7 Sol. Array the set of observations and find the median 13478 4 is the middle item 2. Compute for the median from the following set of data 12 9 6 10 7 & 14 Array the data and computer for the median 6 7 9 10 12 14 Median = (9 + 10 )/2 = 9.5 Finding the Median from the Grouped Data Md = L + [N/2 – F2 / f2] C
Where: L = lower class boundary of the interval where the median lies N = No. of scores or sum of frequency F2 = cumulative frequency less than up to the class immediately preceding the median class (F<) f2 = frequency of the median class C= class size Steps: 1. Prepare 3 columns (Class intervals, class frequency and cumulative frequency less than) 2. Determine the Median class. The median class is that class interval where n/2 lies. 3. Substitute the data to the formula. Ex. Find the median of the frequency distribution Weekly wages,No. of Workers(f) ,F< (cumulative frequency less than) (in peso) 870 – 899, 4, 4 900 – 929 ,6, 10 930 – 959 ,10, 20 – F< 960 – 989, 13, 33 Median class 990 – 1019 ,8 , 41 1020 – 1049, 7, 48 1050 – 1079, 2, 50 N=50 To determine the median class: Solve for N/2 = 50/2 = 25th 25th items fall in 960 – 989 class interval therefore it is the median class Md = L + [(N/2 – F2)/f2] C = 959.5 + [(50/2 – 20) /13] 30 Md = Php 971.04 Posted by Statistics_Lecture at 1:44 AM 0 comments
Thursday, May 15, 2008 Measures of Central Tendency - Mean Central Tendency is the point about which the scores tend to cluster, a sort of average in the series. It is the center of concentration of scores in any set of data. It is a single number which represents the general level of performance of a group.
Three (3) measures of Central Tendency a.) Mean – The mean on arithmetic mean, or arithmetic average is defined as the sum of the values in the data group divided by the no. of values. When to use the mean 1. When the scores are distributed symmetrically around a central point. 2. When the measure of central tendency having the greatest stability is wanted. 3. When other statistics like standard deviation, coefficient of correlation, etc. are to be computer later, since these statistics are based upon the mean. Finding the Mean from Ungrouped Data Where: x = score or measure X = ∑X /N N = No. of scores or measures ∑ = summation of Example: 1.) Last year the five sales counselors of Pacific Plans Inc. sold the following number f educators plans; 24,16,35,13,25. Find the mean. Solution: X = (24+16+35+13+25)/5= 22.6 Finding the mean from Grouped Data: Long method: X = ∑f M / N Where: f = class frequency M = class midpoint N = sum of the frequencies By the “ Assumed mean” or short method : X=AM+(∑f X / N)c Where: AM = assumed mean c = class size x = deviation Example: Scores of 50 students on a college algebra test Class internal, Midpoint, fx, Class frequency f, x, fM scores
45-47 ,46 ,18 ,3 ,6, 138 42-44 ,43, 20, 4 ,5 ,172 39-41 ,40, 16, 4, 4, 160 36-38 ,37 ,12, 4, 3 ,148 33-35 ,34, 4, 2 ,2 ,68 30-32 ,31, 3 ,3 ,1 ,93 27-29 ,28 ,-0 ,13, 0,364 24-26 ,25 ,-8, 8 ,-1, 200 21-23 ,22, -6, 3, -2, 66 18-20, 19, -9, 3, -3,57 15-17, 16, 0, 0 ,-4 ,0 12-14 ,13, -10, 2, -5, 26 9-11 ,10, -6, 1, -6 ,10 34 50 1502 Long method : X = ∑f M/ N = 1502/50=30.04 Assumed mean or short method : Steps : 1. Prepare 4 column ( class interval,f,x,fx ). 2. Select the interval to contain the assumed mean (AM). For the assumed mean, one may take the midpoint of the interval near the center of the distribution, or the midpoint of the interval with the highest frequency. 3. Determine the x column starting with 0, number each class interval positive up to the highest class interval; negative up to the lowest class interval. 4. Multiply f by x to determine the fx column. 5. Find the algebraic sum of the positive fx’s and the negative fx’s to get ∑fx. Short method : X=AM+(∑f X / N)c=28+(34/50)3=30.04 Weighted Arithmetic mean or Combined mean Where: w = weight of x ∑wx = sum of the weight of ∑w = sum of the weight of x Example: The same test was administered to fourth year high school students in 3 schools. Each school had computed its own mean using internal width of 3 as shown below. School A
Class Interval Scores f, x, fx 39-41, 1, 4 ,4 36-38,2 ,3 ,6 33-35, 4 ,2, 8 30-32, 4, 1 ,4 27-29 2 0 0 24-26, 3, -1, -3 21-23 ,4 ,-2, -8 18-20,2,-3,-6 22 5 School B 42-44, 1, 6, 6 39-41, 0, 5,0 36-38, 2, 4 ,8 33-35 ,5 ,3, 15 30-32 ,6, 2, 12 27-29, 7, 1, 7 24-26 3 0 0 21-23, 4, -1, -4 18-20, 2, -2, -4 15-17 ,2, -3 ,-6 12-14 ,1, -4, -4 9-11, 2, -5, -10 35 20 School C 39-41, 1, 3, 3 36-38, 2 ,2 ,4 33-35 ,10, 1, 10 30-32, 6, 0 ,0 27-29 ,7, -1 ,-7 24-26, 2, -2, -4 21-23, 1, -3, -3 18-20, 0 ,-4, 0 15-17 ,1, -5, -5 30 -2 To find the weighted mean of the 3 schools, follow the procedures below: 1. Find the highest and the lowest scores of the schools. 2. Prepare the step intervals column for the combined distribution. 3. Write the frequencies for each steps interval for the three schools. 4. Find the total frequency for each steps interval for the total combined distribution. 5. Compute the mean from this distribution.
Class interval Scores , sch.A (f) ,sch.B (f), sch.c(f) ,total f, x, fx 42-44, 0, 1, 0, 1, 5, 5 39-41 ,1, 0 , 1, 2 ,4,8 36-38 2,2,2,6,3,18 33-35 ,4, 5 ,10 ,19, 2,38 30-32 ,4, 6 ,6, 16, 1 ,16 27-29, 2, 7 ,7, 16, 0 ,0 24-26 ,3, 3, 2, 8, -1, -8 21-23, 4, 4, 1, 9, -2 ,-18 18-20, 2, 2, 0 ,4 ,-3 ,-12 15-17, 0, 2 ,1, 3 ,-4, -12 12-14 ,0 ,1, 0 ,1 ,-5, -5 9-11 ,0, 2, 0, 2 ,-6, -2 22, 35, 30, ,18 X=AM+(∑f X / N)c=28+(18/87)3=28.6 WX = ∑wx / ∑w =22(28.69)+35(26.71)+30(30.8)/87=28.62 Posted by Statistics_Lecture at 5:12 PM 0 comments
Saturday, May 3, 2008 Graphical Method of Presenting Data and Frequency 1. Histogram. Class boundaries (x) vs. class frequency (y) 2. Frequency Polygon. Class Mark (x) vs. class frequency (y) 3. Less than ogive, upper class limit (x) vs. less than cumulative frequency (y) 4. Greater than ogive. Lower class limit (x) vs. greater than cumulative frequency (y) Posted by Statistics_Lecture at 1:31 AM 0 comments
Friday, May 2, 2008 Other Definition of Terms Array – This is the arrangement of data from the highest to lowest or from lowest to highest. Range, R - is the difference between the highest and the lowest number. Number of class- it depends on the size and nature of or class interval distribution. The no. of classes is determined into which the range will be divided. Usually, an effective no. of classes is somewhere between 4 and 20.
No.
of
classes
=
range
/
class
size
or
class
width
+1
Note: a.) If series contains less than 50 cases, 10 cases or less are just enough b.) If series contains 50 to 100 cases, 10 to 15 classes are recommended c.) If more than 100 cases, 15 or more classes are good Class Limit – the end number of a class. It is the highest and the lowest values that can go into each class. Class
Size
–
the
width
of
each
class
interval
Class Boundaries – are the “true” class limits defined by lower and upper boundaries. The lower boundaries can be determined by getting the average of the upper limit of a class and the lower limit of the next class. They can also be obtained by simply adding of a unit (0.5) to the upper limit and subtracting the same to the lower limit of each class. Class Mark, M – also known as class Midpoint. It is the average of the lower and upper limits or boundaries of each class. Class Interval – The range of values used in defining a class. It is simply the length of a class. It is the difference or distance between the upper and lower class boundaries of each class and is affected by the nature of the data and by the number of classes. It is a good practice to set up uniform class interval whenever possible for easier computation and interpretation.