Informatica/SQL/UNIX/Data warehouse questions:
1. what is the difference between source qualifier and joiner?
1. what is the difference between source qualifier and joiner?
Source Qualifier | Joiner |
Is used to join homogeneous database tables | Is used to join heterogeneous database tables and can also join homogeneous tables. Can also join flat files. Ex. Oracle, sql server etc. |
Is used to join n number of tables | Can join only two tables using one joiner |
Can do SQL override | Cannot do SQL Override |
Master and detail tables, master goes into cache which has less number of rows. | |
Different types of joins are Master outer, detail outer, full outer and Normal join. |
2. what is the difference between lookup and joiner?
Lookup | Joiner |
It is passive transformation | It is active transformation |
It can use multiple relational operators | It can use only ‘=’ relational operator |
It can get either first or last row | It gets all rows if match found. |
SQL override can be done | Cannot do SQL override. |
3. what is the difference between filter and router?
Filter | Router |
Is used to give only one value for one column | Can give multiple values for one column in different groups. |
It has input and default groups. You can create as many groups as you want. |
4. what does normalizer do?
Ans : a) COBOL file sources, we get normalizer source qualifier
b) In the pipeline to break one record into multiple records.
5. what does lookup do?
Ans : a) Gets the related value.
b) Updating slowly changing dimensions.
6. what is the difference between connected and unconnected lookup?
Connected lookup | Unconnected lookup |
Gets the value directly from the pipeline | Gets the value indirectly using expression or update strategy |
Can use static and dynamic cache | Can use only static cache |
Can return multiple ports | Can return only one port ( Return port ) |
Can be called only once where specified | Can be called multiple times in a mapping. |
User defined default values can be used if no match found | User defined default values not supported if no match found. |
7. what is the difference between static and dynamic cache?
Static cache | Dynamic cache |
No change in the cache | Inserts, updates or no change in the cache before passing it onto the target table. |
Has Newlookuprow port which has values : 0 – No change 1 – Insert 2 – Update Associated ports is used for comparison or to check for any changes. |
8. what is the difference between mapping parameters and variables?
Mapping parameters | Mapping variables |
Remains constant through out the mapping | Changes through out the mapping |
Ex : Currency conversion or schema name $$curr_rate, $$schema_name | Ex : for delta pull or full pull $$Lastpulldate & $$Enddate |
No special functions used | Functions used : Setvariable Setmaxvariable setminvariable |
Session parameters :
Session parameters are used at session level and remains same through out the session. Its represented using <$sessionparametername>.
9. what is performance tuning? How do you identify the bottlenecks?
Ans : Performance tuning is used for improving the performance.One solution is to do pipeline partitioning
Steps to identify bottlenecks:
a) Look at the session log and try to find out the bug in it.
b) Target bottleneck - Change relational target to flat file target and check the time
c) Source bottleneck – run the SQL query onto the database and find out the time, if taking long time then do the SQL tuning.
d) Mapping bottlenecks – Expression transformation – Use decodes instead of multiple IIF statements, use || instead of multiple concat functions.
Aggregator transformation – Use sorted input for it.
Joiner transformation – Make sure the master table is smaller
Lookup transformation – try to add where clause in Lookup SQL override to restrict no. of rows getting in lookup cache.
10. what is pipeline partitioning?
Ans :Creating partitions means creating multiple reader, writer and transformation threads. Threads execute at the same time.
Different types of partitions are:
a) Pass through – simple Partitioning, usually on Source qualifier
b) Round robin – Balances the load on each of these partitions, usually on filter transformations
c) Key range – We define key ranges on each of the partitions, usually on source and target definitions
d) Hashkey – Places data from each group together, usually on aggregators – tell that you haven’t worked a lot on it.
We can create upto 63 partitions for a particular session .. tell them you have created around 11.
11. how do you read xml files?
Ans :
XSD – Schema definition file
XML – Actual data file It is recommended to import XSD files.
12. what is difference between star and snowflake schema?
Star schema | Snowflake schema |
It is denormalized | It is normalized |
Dimension tables are not broken into other lookup tables | Dimension tables are broken into other lookup tables. |
Easier to access data | Difficult to access the tables as lot of joins will be involved. |
13. what is SCD type 1 and type 2? How do you do the mappings?
SCD – Type 1 | SCD Type -2 | SCD Type -3 |
No History | Full history | Partial history |
If any changes, it will be updated | If any changes then new record will be inserted. Will be maintained by one of following : Version number Flag Effective dates | If any changes, new column will be added. First and last information will be stored. |
14. What is the complex mapping that you have worked on?
Ans : SCD Type – 2
15. How do you remove duplicates from a flat file?
Ans : By using aggregator and grouping by all the columns from the flat file. In order to remove duplicates from UNIX box, use sort – u option.
16. How do you find out duplicates from the table?
Ans : Example : Select empno,ename, count(*) from emp Group by empno, ename having count(*) > 1
17. What do you do if you get an error in the session?
Ans : Check the error in session log and try to find out where exactly is the error in the mapping.
18. what is the difference between rank and aggregator?
Rank | Aggregator |
Rank is active | Aggregator is also active |
It groups by but can give all the ranks within that group | It groups by and gives out only last row from that group |
Rankindex column is automatically created | Aggregate functions can be performed like: MAX MIN SUM COUNT AVG |
19. How did you do the full pull and delta load?
Ans :Full pull – read entire table as source
Delta pull – read only part of table as source, maybe only 5 days worth data.
21. How did you use to schedule the etl jobs?
Ans : Used schedulers like
Tivoli workflow scheduler – TWS
Autosys
Crontab.
22. How did you use to do SQL tuning? Do you know anything about explain plan?
22. How did you use to do SQL tuning? Do you know anything about explain plan?
Ans : Some of the steps in SQL Tuning :
a) Try to join tables including all their natural keys.
b) Try to remove subqueries
c) Try to reduce aggregations.
23. What is IDQ and IDE?
Ans : IDQ – Informatica Data Quality
IDE – Informatica Data explorer. If asked tell them that you haven’t worked a lot on it.
24. How did you use to get the files from Mainframe?
Ans : Mainframe team used to FTP the files onto our UNIX box and sometimes we used to pull those from Mainframe using Informatica Power Exchange.
25. which tables will you load first? Fact or Dimension table?
25. which tables will you load first? Fact or Dimension table?
Ans : Dimension tables will be loaded first and then fact tables.
26. what were the sizes of the fact and dimension table that you have worked on?
Ans : Dimension tables – around few thousand records
Fact tables – around 15-20 million records.
27. what are different types of joins and how do you write queries for that?
Ans : Inner join – gets only when condition matches
Left outer – gets everything from right and those only which are matching from left.
Right outer - gets everything from left and those only which are matching from right.
Full outer – gets everything from both the tables
Self join – joins with the same table again.
28. How do you denormalize data?
Ans : look at the interview diagram.
29. What is the difference between active and passive transformations?
Ans : If the number of rows going in the transformation is not equal to the number of rows going out of it then it is active. If they are same then passive transformation.
30. What does update strategy do?
Ans : Update strategy is used to flag the rows for the following operations :
DD_INSERT
DD_UPDATE
DD_REJECT
DD_DELETE
DD_UPDATE
DD_REJECT
DD_DELETE
31. what is the difference between union and union all, delete and truncate?
Ans : Union all – merges two tables with all the values
Union – merges two tables with only distinct values
Delete – can delete records which we want and we can rollback
Truncate – deletes all the records and cannot rollback.
32. what does the following commands in UNIX do?
a) ls –ltr – displays all the files with their times and permissions.
b) vi – editor to view the files and make changes in it.
c) cp, mv, rm, mkdir, cat – cp is for copy, mv is for renaming the file, rm is for removing the file, mkdir is for creating directory, cat is for viewing the file and not making changes.
d) grep – used for searching patterns in a file or multiple files.
e) sort – u - Is used for removing duplicates.
f) pwd – shows present working directory
33. what is the scheduling tool that you have been working on?
Ans : TWS ( Tivoli workflow scheduler )
34. what do you mean by CDC?
34. what do you mean by CDC?
Ans : CDC – Change data capture. Picks up only changed records from the source.
35. what did you use PowerExchange for and which version?
Ans : Extracting files from Mainframe.
36. What is the difference between MINUS and INTERSECT?
Ans :
MINUS – If A MINUS B then picks all the records from A which do not exist in B.
INTERSECT – If A INTERSECT B then picks up common records between A and B.
37. what are the new features of version 8? what is Pushdown optimization?
Ans : New transformations are added :
Java transformation
SQL transformation
HTTP transformation
Push down optimization – Pushes part of the transformation logic in informatica onto the database to improve the performance.
38. what are the transformations that use cache and which are active and which are passive?
Ans :Transformations that uses cache are : Sorter, aggregator, lookup, joiner, rank.
Passive transformations : Expression, lookup, stored procedure
Active transformations : others are all active.
39. How do you handle performance issues with aggregator and sorter?
Ans : Use sorter transformation before aggregator that means passing sorted input to aggregator may improve the performance.
40. What is versioning?
Ans :Its used to keep history of changes done on the mappings and workflows
Check in : You check in when you are done with your changes so that everyone can see those changes.
Check out : You Check out from the main stream when you want to make any change to the mapping/workflow.
Version history : It will show you all the changes made and who made it.
41. what is target load order plan?
Ans :If a mapping has two source qualifiers going to different targets, then we can define the order in which they can load
Ex : A -Ã B
B -Ã C
Mappings (tab) -Ã Target Load Order Plan
42. what are mapplets and worklets?
Ans : Mapplets are reusable codes, you define some complex logic in It and can use across different mappings in the repository. You cannot have target definition in it but can have source definition.
43. what are different types of tasks that you have worked on?
Ans:Email Task – sends out email to group after successful execution of the task.
b) Command Task – used to write some UNIX commands.
c) Session Task – associated with each mapping.
d) Decision task – Used as a router.
44. what are pmcmd commands and have you written perl scripts or shell scripts on your own? what did you do in that?
Ans :These are the UNIX commands that are used to start or stop informatica server, run workflows etc.. tell them there were predefined scripts for it which you use to run.
45. What is database partitioning? what are different types?
Ans : Database partitioning is dividing table into different parts.
Different types :
a) Range partitioning
b) List partitioning
c) Hash partitioning
d) Composite partitioning
46. what are cursors, triggers, stored procedures and functions, views? and what is the difference between them?
Ans :Cursors - A cursor is a mechanism by which you can assign a name to a "select statement" and manipulate the information within that SQL statement.
We've categorized cursors into the following topics:
Triggers – They get executed at a certain event.
Stored procedures – reusable objects, may or may not have return value.
Functions – reusable objects, always have return value.
Views – Virtual table, its not physical table and gets data according to select statement.
47. what is the difference between exists and IN clause?
Ans :IN - The IN function helps reduce the need to use multiple OR conditions.
EXISTS - The EXISTS condition is considered "to be met" if the subquery returns at least one row.
48. what does the debugger do?
Ans : Debugger is used to stop at some breakpoints and look at the data flow. Helps in identifying where exactly the issue is.
49. what is difference between version 7 and version 8?
Ans : In Version 8, new transformations like Java, SQL, HTTP are added and push down optimization feature is introduced.
50. what are different types of stored procedure transformations?
Ans : Normal – connected / unconnected
Source pre load ( Unconnected ) – check existence of tables
Source post load ( Unconnected ) – drop temporary tables
Target pre load ( Unconnected ) – drop indexes on target table
Target post load ( Unconnected ) – recreate indexes on target table.
51. what are presession and post session commands?
Ans :presession commands : This can be defined in the session, you can write some sql query that will execute before the session kicks off.
postsession commands : This can be defined in the session, you can write some sql query that will execute after the session kicks off.
52. what do you do when you have problem loading targets?
Ans : drop indexes and recreate them.
53. what is sequence generator transformation?
Ans :It generates sequence ids ( system generated values ). We get these values from Nextval column and Currval column contains the repository value.
54. How did you schedule your informatica jobs?
Ans : Either by writing all workflows in perl/shell scripts and schedule those scripts or schedule the workflow jobs directly.
55. How many batch jobs you use to run in a day?
Ans : around 200-300 batch jobs.
56. How much time each job used to take to run?
Ans : Depends, some use to take 1 hour and some use to take 2-3 minutes.. depends on the job and complexity.
Nice article. Great work.
ReplyDeleteFor informatica scenarios: informatica scenarios
Very excellent post for more post please visit below for information
ReplyDeleteInformatica Online Training
This comment has been removed by the author.
ReplyDeleteNice Explanation on Informatica online training
ReplyDeleteInformative blog! it was very useful for me.Thanks for sharing. Do share more ideas regularly.
ReplyDeleteBest Spoken English Class in Chennai
Spoken English Course in Chennai
Best Spoken English Classes in Chennai
Spoken English Training near me
Best Spoken English Institute in Chennai
English Coaching Class in Chennai
English Courses in Chennai
Thanks for sharing a worthy information. This is really helpful. Keep doing more.
ReplyDeleteArticle submission sites
Education
Thanks for sharing this valuable information to our vision. You have posted a worthy blog keep sharing.
ReplyDeleteHadoop Course in Chennai
Best Big Data Training in chennai
Hadoop training Institutes in chennai
Hadoop Big data training in chennai
best hadoop training in bangalore
bigdata and hadoop training in bangalore
big data training institutes in bangalore
Such a great article! This is very useful. Thanks for sharing this with us.
ReplyDeleteVMware Training in Chennai
VMware Course in Chennai
VMware Learning
Mobile Testing Course in Chennai
Tally Course in Chennai
oracle Apps DBA Training in chennai
Unix Training in Chennai
the article is good.the information is well and useful for all.i want this type of article.thanks.
ReplyDeleteAWS course in Chennai
Data Science Course in Chennai
Big Data Analytics Training in Chennai
DevOps Training in Chennai
DevOps course in Chennai
AWS Training in OMR
AWS Training in Porur
Amazing Post. Looking for this kind of information for a long time. Thanks for Posting.
ReplyDeleteInformatica Training in Chennai
Informatica Training Center Chennai
Informatica Training chennai
Informatica Training institutes in Chennai
Informatica Training in Adyar
Informatica Training in Velachery
Amazing content. Extra-ordinary piece of work. Waiting for more updates.
ReplyDeleteBlockchain certification
Blockchain course
Blockchain Training
Blockchain Training in Chennai
Blockchain course in Chennai
Blockchain Training in Velachery
Blockchain Training in Tambaram
Blockchain Training in Adyar
QuickBooks has made payroll management quite definitely easier for accounting professionals. There are plenty individuals who are giving positive feedback QuickBooks Payroll Support Phone Number they process payroll either QB desktop and online options.
ReplyDeleteA team of QuickBooks Support Number dedicated professionals is invariably accessible to suit your needs so as to arranged all of your problems in an endeavor that you’ll be able to do your projects whilst not hampering the productivity.
ReplyDeleteThe Tabs Of Different Files While Managing Accounts Thus They Find Some Hurdles, That'll Be Not A Worrisome Situation. Quickbook Enterprise Support Number Helps A Person To Have Assistance In Real Time. Although QuickBooks Enterprise Technical Support Number Is A Toll-Free Number, Which Can Be Dialed.
ReplyDeleteThough QuickBooks Payroll Support Number a good tool for QuickBooks users in the commercial realm, yet some hits and misses may arise anytime during use. Every one of these issues become a good inconvenience to users causing a collection back again to the highly reputed nature of QuickBooks accounting software.
ReplyDeleteQuickBooks Payroll Technical Support Phone Number provides 24/7 make it possible to our customer. Only you need to do is make an individual call at our toll-free QuickBooks Payroll tech support number . You could get resolve all the major issues include installations problem, data access issue, printing related issue, software setup, server not responding error etc with this QuickBooks payroll support team.
ReplyDeleteThe process is quite simple to contact them. First, you must sign in to your organization. There clearly was a help button at the very top right corner. You are able to click and inquire any question about QuickBooks accounting software. You may also contact our US customer support team using QuickBooks Technical Support Number.
ReplyDeleteAs QuickBooks Tech Support Number Premier has various industry versions such as retail, manufacturing & wholesale, general contractor, general business, Non-profit & Professional Services, there clearly was innumerous errors which will make your task quite troublesome.
ReplyDeletethanks for sharing this post
ReplyDeletedata Science training in chennai
aws training center in chennai
aws training in chennai
aws training institute in chennai
best devops training in chennai
devops training in chennai
best java training in chennai
Welcome aboard, to our support site par excellence where all your worries related to the functioning of QuickBooks Enterprise will undoubtedly be addressed by our world-class team of QuickBooks Enterprise Support Number US into the blink of a watch. If you're experiencing any hiccups in running the Enterprise version of the QuickBooks software for your needs, it is best not to waste another second in trying to find a remedy for the problems.
ReplyDeleteAnd also to offer these services on a round-the-clock basis to any or all QB Enterprise users, we have QuickBooks Enterprise Support Phone Numbe toll-free in place, to offer all QB Enterprise users excellent support for many their glitches and address all their issues in a jiffy.
ReplyDelete
ReplyDeleteThe guide may have helped you understand QuickBooks Support Phone Number file corruption and ways to resolve it accordingly. If you wish gain more knowledge on file corruption or any other accounting issues, then we welcome you at our professional support center.
Concerning easy, is it possible to start supposing like lack of usefulness and flexibility yet this is to ensure that Intuit QuickBooks Support Number has emphasize wealthy accounting programming? Thus, this item package can without much stretch handle the demands of growing associations.
ReplyDeleteNo matter if you're getting performance errors or you are facing any type of trouble to upgrade your software to its latest version, you are able to quickly get help with QuickBooks Support.
ReplyDeleteQuickBooks Tech Support Number is accounting software, which will be a cloud-based application produced by Inuit Inc. As a matter of fact, the program has been developed with all the intention of keeping a secure record of financial needs for the business. Additionally, it is a user-friendly accounting software; an easy task to maintain; assisting the business to keep the records of financial transactions, and many other things features.
ReplyDeleteIntuit QuickBooks Support Number helps to make the process far more convenient and hassle free by solving your any QuickBooks issues and error in only an individual call.
ReplyDeleteOur support team is involved with pre-research to create themselves prepared in advance for the possible errors of QuickBooks Support Number. This practice helps them produce you the specified end up in the given time window.
ReplyDeleteIntuit is perhaps all concerning User expertise which explains why they need creating dedicated QuickBooks Support Phone Number Phone variety; Users will dial the fee number just in case they will have any facilitate in regards to the code.
ReplyDeleteQuickBooks Enterprise Support Number here to make tech support team to users. In September 2005, QuickBooks acquired 74% share related to market in the united states. A June 19, 2008 Intuit Press Announcement said that at the time of March 2008, QuickBooks’ share of retail units inside the industry accounting group touched 94.2 percent, based on NPD Group.
ReplyDeleteWith exceptional features, QuickBook helps most of the kinds of businesses with generating accounting reports, entries for every single sale, transactions pertaining to banking, etc., with a lot of ease. And along side support for QuickBooks, it really is much simpler to undertake all of the tools of QuickBooks in a hassle-free manner. Below is a listing of several QuickBooks errors that one may meet with while you are deploying it. Have a glimpse at it quickly.
ReplyDelete
ReplyDeleteQuickBooks Payroll Support Phone Number is an end to finish business, advanced competitive accounting software. But because it's a premium software with several advanced features, taking support for the software is a significantly better choice to run this impressive software without the technical issue.
ReplyDeleteQuickBooks Support Number get you one-demand technical help for QuickBooks. QuickBooks allows a number of third-party software integration.